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T ECHN ICAL FIELD OF INVENTION 

; invention relates to DNA sequences, recombi- ; 



5 nant DNA molecules and processes for producing interferon 
and interferon-like polypeptides. More particularly, the 
invention relates to DNA sequences expressed in appropriate 
host organisms. The recombinant DNA molecules disclosed 
herein are characterized by DNA sequences that code for 

10 polypep"tides having an immunological or biological activity 
of human leukocyte interferon- As will be appreciated 
from the disclosure to follow, the DNA sequences, recombi- 
nant DNA molecules and processes of this invention may be 
used in the production of polypeptides useful in antiviral 

15 and antitumor or anticancer agents and methods. 

BACKGROUND ART 

In this application the interferon nomenclature 
announced in Nature, 286, p. 2421 (July 10, 1980) will be 
used. This nomenclature replaces that used in our earlier 
2 0 applications from which this application claims priority. 
E.g. , IF is now designated IFNand leukocyte interferon 
is now designated IFN-cr . 

Two classes of interferons ("IFN") are known to 
exist. Interferons of Class I are small, acid stable 
2 5 ( glyco ) -proteins that render cells resistant to viral 



infection (A. Isaacs and J. Lindenmann, "Virus Interfer- 
ence I. The Interferon", Proc. Royal Soc. Ser. B. , 147, 
pp. 258-67 (1957) and W. E. Stewart, II, The Interferon 
System , Springer-Verlag (1979) (hereinafter " The Interferon 
System " ) ) . Although to some extent cell specific ( The 
Interferon System , pp. 135-45), IFNs are not virus specific. 
Instead IFNs protect cells against a wide spectrum of 
viruses. 

Human interferons ("HuIFN" ) have been classified 
into three groups a, p and 7. HuIFN-a or leukocyte 
interferon is produced in human leukocyte cells and 
together with minor amounts of HuIFN-p (fibroblast inter- 
feron) in lymphoblastoid cells. HuIFN-cr has been purified 
to homogeneity and characterized ( e.g. M. Rubenstein - 
et al. , "Human Leukocyte Interferon: Production, Purifi- 
cation To Homogeneity And Initial Characterization" 
Proc. Natl. Acad. Sci . USA , 76, pp. 640-44 (1979)). It 
is heterogeneous in regard to size presumably because of 
the carbohydrate moiety. Two components have been 
described, one of 21000 to 22000 and the other of 15000- 
18000 molecular weight. The component of lower molecular 
weight has been reported to represent a non-glycosylated 
form. The smaller form of HuIFN-cr has also been reported 
to retain most or all of its HuIFN-a activity (W. E . 
Stewart, II et al . , "Effect Of Glycosylation Inhibitors 
On The Production And Properties Of Human Leukocyte 
Interferon", Virology , 97, pp. 473-76 (1979)). A portion 
of the amino acid sequence of HuIFN-a from lymphoblastoid 
cells and its amino acid composition have been reported 
(K.C. Zoon et al. , "Amino Terminal Sequence Of The Major 
Component Of Human Lymphoblastoid Interferon", Science , 
207, pp. 527-28 (1980) and M. Hunkapiller and L. Hood, 
personal communication (1980)) 

HuIFN-a has also been reported to exist in 
several . different forms, e.g. British patent application 
2,037,296A. These forms appear to differ from each other 
structurally and physiologically. No accepted nomenclature 
has-been adopted for these forms of HuIFN-a . Therefore, 



in this application each form will be referred to by a 
number after the general HuIFN-u designation, i.e. , 
HuIFN-al or HuIFN-or3. 

HuIFN-a may, like many human proteins, also be 
polymorphic. Therefore, cells of particular individuals 
may produce HuIFN-a species within the more general 
HuIFN-a group or forms within that group which are physio- 
logically similar but structurally slightly different 
than the group or form of which it is a part. Therefore, 
while the protein structure of an HuIFN-or may be generally 
well-defined, particular individuals may produce a HuIFN-or 
that is a slight variation thereof, this allelic variation 
probably being less severe than the difference between 
the various forms of HuIFN-a. 

HuIFN is usually not detectable in normal or 
healthy cells ( The Interferon System , pp. 55-57). Instead, 
the protein is produced as a result of the cell's exposure 
to an.IFN inducer. I FN inducers are usually viruses but 
may also be non-viral in character, such as natural or 
synthetic double-stranded RNA, intracellular microbes, 
microbial products and various chemical agents. Numerous 
attempts have been made to take advantage of these non-viral 
inducers to render human cells resistant to viral infection 
(S, Baron and F. Dianzani ( eds .) , . Texas Reports On Biology 
And Medicine , 35 ("Texas Reports" ),• pp . 528-40 (1977)). 
These attempts have not been very successful. Instead, 
use of exogenous HuIFN itself is now preferred. 

Interferon therapy against viruses and tumors 
or cancers has been conducted at varying dosage regimes 
and under several modes of administration (The Interferon 
System, pp. 305-321). For example, interferon has been 
effectively administered orally, by innoculation — 
intravenous, intramuscular, intranasal, intradermal and 
subcutaneous — , and in the form of eye drops, ointments 
and sprays. It is usually administered one to three 
times daily in dosages of 1Q 4 to 10 7 units. The extent 
of the therapy depends on the patient and the condition 
being treated. For example, virus infections are usually 
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treated by daily or twice daily doses over several days 
to two weeks and tumors and cancers are usually treated 
by daily or multiple daily doses over several months or 
years. The most effective therapy for a given patient 
5 must of course be determined by the attending physician 
who will consider such well known factors as the course 
of the disease, previous therapy, and the patient's 
response to interferon in selecting a mode of administra- 
tion and dosage regime. 
10 As an antiviral agent, HuIFN has been used to 

treat the following: respiratory infections (Texas Reports, 
pp.* 486-96); herpes simplex keratitis ( Texas Reports , 
pp. 497-500); acute hemorrhagic, conjunctivitis ( Texas R eports,, 
pp. 501-10); varicella zoster ( Texas Reports , pp. 511-15); 
15 cytomegalovirus infection ( Texas Reports , pp. 523-27); 
and hepatitis B ( Texas Reports , pp. 516-22). See also 
The Interferon System , pp. 307-19. However, large scale 
use of I FN as an antiviral agent requires larger amounts 
of I FN than heretofore have been available. 
20 HuIFN has other effects in addition to its 

antiviral action. For example, it antagonizes the effect 
of colony stimulating factor, inhibits the growth of 
hemopoietic colony- forming cells and interferes with the 
normal differentiation of granulocyte and macrophage 
25 precursors ( Texas Reports , pp. 343-49). It also inhibits 
erythroid differentiation in DMSO-treated Friend leukemia 
cells ( Texas Reports , pp. 420^28). HuIFN may also play a 
role in regulation of the immune response. Depending 
upon the dose and time of application in relation to 
30 antigen, HuIFN-a can be both immunopotentiating and 

immunosuppressive in vivo and in vitro ( Texas Re ports, 
pp. 357-69). In addition, specifically sensitized lympho- 
cytes have been observed to produce HuIFN-a after contact 
with antigen. Such antigen-induced HuIFN-a could therefore 
35 ■ be' a regulator of the immune response, affecting both 

circulating antigen levels and the expression of cellular 
immunity ( Texas Reports , pp. 370-74). HuIFN is also 
known to enhance the activity of killer lymphocytes and 
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antibody-dependent cell-mediated cytotoxicity (R. R. 
Herberman et al . , "Augmentation By Interferon Of Human 
Natural And Antibody Dependent Cell-Mediated Cytotoxicity" , 
Nature , 277, pp. 221-23 (1979); P. Beverley and D. Knight, 
"Killing Comes Naturally", Nature , 278, pp. 119-20 (1979); 
Texas Reports , pp. 375-80). Both of these species are 
probably involved in the immunological attack on tumor 
cells-. 

Therefore, in addition to its use as a human 
antiviral agent, HuIFN has potential application in anti- 
tumor and anticancer therapy ( The Interferon System, 
pp. 319-21). It is now known that IFNs affect the growth 
of many classes of tumors in many animals ( The Int erferon. 
System , pp. 292-304). They, like other antitumor agents, 
seem most effective when directed against small tumors. 
The antitumor effects of animal I FN are dependent on 
dosage and time but have been demonstrated at concentra- 
tions below toxic levels. Accordingly, numerous investi- 
gations and clinical trials have been and continue to be 
conducted into the antitumor and anticancer properties of 
IFNs. These include treatment of several malignant 
diseases such as osteosarcoma, acute myeloid leukemia, 
multiple myeloma and Hodgkin's disease (Texas Reports, 
pp. 429-35). In addition, HuIFN has recently been shown 
to cause local tumor regression when' injected into subcu- 
taneous tumoral nodules in melanoma- and breast carcinoma- 
affected patients (T. Nemo to et al. , "Human Interferons 
And Intralesional Therapy .Of Melanoma And Breast Carcinoma 
Amer. Assoc. For Cancer Research , Abs. nr. 994, p. 246 
(1979)). Although the results of these clinical tests 
are encouraging, the antitumor and anticancer applications 
of I FN have been severely hampered by lack of an adequate 
supply of purified IFN. 

Today, HuIFN-a is produced either through human 
cells grown in tissue culture or through human leukocytes 
collected from blood donors. . 2.6 x 10 9 IU of crude 
HuIFN-a have been reported from 800 1 of cultured Namalva 
cells (P, J. Bridgen et al . , supra )." At very large blood 
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centers, e. g. , the Finnish Red Cross Center in Helsinki, 
Finland, the production capacity is about 10 11 IU of 
crude HuIFN-a annually. Since dosage is typically 3 x 10 6 
IU per patient per day, these sources are not adequate to 
provide the needed commercial quantities of HuIFN-a. 
Therefore, production of HuIFN-a by other procedures is 
desirable. Because the specific activity of IFN-o is 
high, in the order of 4.0 x 10 8 to 10 9 IU/mg, the amount 
of HuIFN-a required for commercial applications is low. 
For example, 100 grams of pure HuIFN-a would provide 
between 3 and 30 million doses. 

Recent advances in molecular biology have made 
it possible to introduce the DNA coding for specific 
nonbacterial eukaryotic proteins into bacterial cells. 
In general, with DNA other than that prepared via chemical 
synthesis, the construction of such recombinant DNA 
molecules comprises the steps of producing a single- 
stranded DNA copy (cDNA) of a purified messenger RNA 
(mRNA) template for the desired protein; converting the 
cDNA to double-stranded DNA; linking the DNA to an appro- 
priate site in an appropriate cloning vehicle to form a 
recombinant DNA molecule and transforming an appropriate 
host with that recombinant DNA molecule. Such transforma- 
tion may permit the host to produce the desired protein. 

Several non-bacterial proteins and genes have 
been obtained in E. coli using recombinant DNA technology. 
These include a protein displaying rat proinsulin antigenic 
determinants (L. Villa-Komarof f et al ., "A Bacterial 
Clone Synthesizing Proinsulin 11 , Froc. Natl. Aca d. Sci. USA, 
75, pp. -3727-31 (1978)), rat growth hormone (P. H. Seeburg 
et al . , "Synthesis Of Growth Hormone By Bacteria", Nature , 
276, pp. 795-98 (1978)), mouse dihydrof olate reductase 
(A.C.Y. Chang et al . , "Phenotypic Expression In E. coli 
Of A DNA Sequence Coding For Mouse Dihydrof olate Reductase", 
Nature , 275, pp. 617-24 (1978)), human somatostatin (K. 
Itakura et al . , "Expression- In Escherichia coli Of A 
Chemically Synthesized Gene For The Hormone Somatostatin", 
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Science , 198, pp. 1056-63 (1977)); European patent appli- 
cations 0,001,929, 0,001,930, and 0,001,931 and cognate 
applications in other countries), the A and B polypeptide 
chains of"human insulin (D. V. Goeddel et al . , "Expression 
5 In Escherichia coli Of Chemically Synthesized Genes For 

Human Insulin", Proc, Natl. Acad. Sci. USA , 76, pp. 106-10 
(1979) and the European and related patent specifications, 
supra ) , antigens of human hepatitis B virus (C. J. Burrell 
et al . , "Expression In Escherichia coli : Of Hepatitis B 

10 Virus DNA Sequences Cloned In Plasmid pBR322", Nature, 
279, pp. 43-7 (1979) and M. Pasek et al . , "Hepatitis B 
Virus Genes And Their Expression In E. coli ", Nature , 
282, pp. 575^79 (1979)), human growth, hormone (D. V. 
Goeddel et al . , "Direct Expression In Escherichia coli Of 

15 A DNA Sequence Coding For Human Growth Hormone", Nature , 
281, pp. 544-51 (1979)), SV40 t antigen (T. M. Roberts 
et al . , "Synthesis Of Simian Virus 40 t Antigen In 
Escherichia coli ", Proc. Natl. Acad. Sci. USA , 76, 
pp. 5596-600 (1979)), and human fibroblast interferon 

20 (HuIFN-p) (T. Taniguchi et al . , "Construction And Identi- 
fication Of A Bacterial Plasmid Containing The Human 
Fibroblast Interferon Gene Sequence", Proc. Japan Acad. , 
55, Ser. B, pp. 464-69. (1979) together with personal 
communication 1980). 

25 None of these recombinant DNA processes, however 

is directed, as is this invention, toward the synthesis 
of HuIFN-a . This is the problem to which the present 
invention is addressed. Its solution is not facilitated 
as. were the above described recombinant DNA schemes by 

30 the availability of the sequence information required to 
prepare a synthetic gene (e.g., somatostatin) or of a 
cell type or virus rich in a particular DNA sequence 
(e.g., hepititis viral antigen) or mRNA species (e.g., 
rat insulin) which allows preparation and identification 

35 of bacterial clones containing the desired hybrid DNA, or 
of a system allowing the selection of E. coli hosts that 
express the desired protein (e.g., mouse dihydrofolate 
reductase). Neither is it aided by the report of a 



-8- 



plasmid which is said to contain a DNA sequence that 
hybridizes to a mRNA from a poly(A) RNA, that mRNA produc- 
ing HuIFN-p activity in oocytes (e. g. , fibroblast inter- 
feron). Nor is the solution of the present invention 
addressed as is the apparent suggestion of Research 
Disclosure No. 18309, pp. 361-62 (1979) to preparing pure^ 
or substantially pure HuIFN-crmRNA before cloning of the. - 
HuIFN-a gene. 

Finally, it should be recognized that the 
selection of a DNA sequence or the construction of a 
recombinant DNA molecule which hybridizes to a mRNA from . 
polyA RNA, that mRNA producing EuIFN activity in oocytes, 
is not sufficient to demonstrate that .the DNA sequence or 
the hybrid insert of the recombinant DNA molecule corre- 
sponds to HuIFN. Instead, only the production of a 
polypeptide that displays an immunological or biological . 
activity of HuIFN can actually demonstrate that the 
selected DNA sequence or constructed recombinant DNA 
molecule corresponds to HuIFN. More importantly, it is 
only after HuIFN activity is shown that the DNA sequence, 
recombinant DNA molecule or sequences related to them may 
be employed to select other sequences corresponding to 
HuIFN in accordance with this invention. 

It will therefore be appreciated from the 
foregoing that the problem of producing HuIFN-a with the 
use of recombinant DNA technology is much different than 
any of the above described processes. Here, a particular 
DNA sequence of unknown structure — that coding for the 
expression of HuIFN-a in an appropriate host — must be 
found in' and separated from a highly complex mixture of 
DNA sequences in order for it to be used in the production 
of HuIFN-a. Moreover, this location and separation 
problem is exacerbated by the predicted exceedingly low 
concentration of the desired DNA sequence in the complex 
mixture and the lack of an effective means for rapidly 
analyzing the many DNA sequences of the mixture to select 
and separate the desired sequence. 



DISCLOSURE OF THE INVENTION 



The present invention solves the problems 
referred to by locating and separating DNA sequences that 
code for the expression of HuIFN-a in an appropriate host 
and thereby providing. DNA sequences, recombinant DNA 
molecules and methods by means of which a host, is trans-- 
formed to produce a polypeptide displaying an immunological 
or biological activity of human leukocyte interferon. 

By virtue of this invention, it is possible to 
obtain polypeptide ( s ) displaying an immunological or 
biological activity of HuIFN-a for use in antiviral, 
-antitumor or anticancer agents and methods. This inven- 
tion allows the production of these polypeptides in 
amounts and by methods hitherto not' available. 

As will be appreciated from the disclosure to 
follow, the DNA sequences and recombinant DNA molecules 
of the invention are capable of directing the production, 
in an appropriate host, of a polypeptide displaying an 
immunological or biological activity of HIFN-a. Replica- 
tion of these DNA sequences and recombinant DNA molecules 
in an appropriate host also permits the production in 
large quantities of genes coding for these polypeptides. 
The molecular structure and properties of these polypep- 
tides and genes may be readily determined. The polypep- 
tides and genes are useful, either as produced in the 
host or after appropriate derivatization or modification, 
in compositions and methods for detecting and improving 
the production of these products themselves and for use 
in antiviral and antitumor or anticancer agents and 
methods . 

This .process is therefore distinguishable from 
the prior processes, above mentioned, in that this process, 
contrary to the noted prior processes, involves the 
preparation and selection of DNA sequences and recombinant 
DNA molecules which contain appropriate DNA sequences 
which code for at least one polypeptide displaying an 
immunological or biological activity of HuIFN-a. 



It will be appreciated from the foregoing that 
a basic aspect of this invention is the provision of a 
DNA sequence which is characterized in that it codes for 
a polypeptide displaying an immunological or biological 
5 activity of HuIFN and is selected from the group consisting 
of the DNA inserts of Z-pBR322 (Pst )/HcIF-4c, Z-pBR322 (Pst)/ 
HcrF-2h, Z-pBR322(Pst)/HcIF-SN35, Z-pBR322 (Pst)/HcIF-SN42 , 
Z-pKT287(Fst)/HciF-2h-AH6, DNA sequences which hybridize 
to any of the foregoing DNA inserts, DNA sequences, from 

10 whatever source obtained, including natural, synthetic or 
semi-synthetic sources, related by mutation, including 
single or multiple, base substitutions, deletions, inser- 
tions and inversions to any of the foregoing DNA sequences 
or inserts, and DNA sequences comprising sequences of 

15 codons which, on expression code for a polypeptide display- . 
ing similar immunological or biological activity to a 
polypeptide coded for on expression of the codons of any 
of the foregoing DNA sequences and inserts and that these 
sequences permit the production of interferon and inter- 

2 0 feron-like polypeptides in hosts. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic outline of one embodi- 
ment of a process of this invention for preparing a 
mixture of recombinant DNA molecules, some of which are 
25 characterized by inserted DNA sequences that code for 
polypeptides of this invention. 

Figure 2 is a schematic outline of the initial 
clone screening process of this invention. 

Figure 3 is a schematic outline of one embodi- 
30 ment of a clone screening process using DNA sequences 
prepared in accordance with the invention. 

Figure 4 is a restriction map of one of the 
clones of the invention, t & o- abs olut e - position of - 
f eStritftion site in this cluue is n o t depic 

■■ 10 dlbpliLty mor - o -absolute -p o aitions f o r -tSgSB 
bon cite&t 
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Figure 5 is a schematic outline of the process 
of determining the orientation of a DNA insert in one 
recombinant DNA molecule of this invention. 

Figur 6 displays the partial nucleotide se- 
5 quence of some cloning vehicles useful in accordance with 
this invention. 

Figure 7 displays the results of a Sephadex 
G-100 fractionation of supernatant prepared from a bac- 
terial culture of this invention* 
10 Figures 8-10 display the nucleotide sequence of 

a DNA insert to a recombinant DNA molecule of this inven- 
tion* The sequence is numbered from the nucleotide 
following the polyG 5' tail to the nucleotide before the 
polyA residues and polyC 3 ' tails . Nucleotides 57-125 
15 represent a signal sequence and nucleotides 126-626 
represent the "mature" interferon and the stop codon. 
The amino acid sequence of the signal sequence is depicted 
above its nucleotide sequence in lower case letters and 
the amino acid sequence of the "mature" inteferon is 
depicted above its nucleotide sequence in upper case 
letters. Various restriction endonuclease recognition 
sites in this gene also, depicted in Figures, 8 
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these sites being m o re dJS s olutdly boated/? than fr r 
*fe± tk m Figure 1. * 

2 5 Figure 11 is a schematic comparison of the 

restriction maps of two DNA inserts of recombinant DNA 
molecules of this invention. 

Figures 12-16 display the nucleotide sequences 
of two DNA inserts of recombinant DNA molecules of this 
30 invention. The sequences are numbered from the nucleotide 
following the polyG 5 1 tail to the nucleotide before the 
polyA residues and polyC 3 1 tails. The amino acid sequence 
of the signal sequence for each of these inserts is 
depicted above its respective nucleotide sequence in 

3 5 lower case letters and the amino acid sequence of the 

"mature" interferon is depicted above its nucleotide 
sequence in upper case letters. 



Figure 17 displays a partial restriction map of 
Z-pBR322(Pst)/HcIF-II-206 and the sequencing strategy 
employed to determine the nucleotide sequence of the 
Hif-I 1-206 fragment displayed in Figures 12-16. 

Figure 18 displays the partial restriction maps 
of a series of hybrid phages which hybridize to the 
Hif-2h fragment. 

Figure 19 displays a partial restriciton map of 
the hybrid insert of Z-pBR322Pst/HchrIF-35HBa and the 
sequencing strategy employed to determine its nucleotide 
sequence. 

• Figures 20-23 display the nucleotide sequence 
of the HchrIF-35HBof frament and the amino acid sequence 
derived from it. 

Figure 24 displays partial linkage maps for 
HuIFN-or related genes. The arrows show regions which 
form R-loops with induced leukocyte poly (A) RNA. The 
hatched box (chr-16) indicates the sequence which was 
inferred from blotting experiments, but was not revealed 
by R-loop mapping. . 

Figure 25 is a schematic outline of the construe 
tion of plasmid C8-IFN-al. 

Figure 26 is a - schematic outline of the construe 
tion of plasmid LAOAUG(a2). 

Figure 27 displays the reconstruction of the 
AUG initiation codon and the CYS initial codon in the 
construction of LAC-AUG(cr2 ) . 

Figure 28 is a schematic outline of the construe 
tion of plasmid C8-IFN-a2 and the hybrid molecules I, II, 
'ill and IV. 

Figures 29-32 display the nucleotide sequence 
and amino acid sequence encoded thereby for IFN-or4b and 
its signal sequence. 

. BEST MODE OF CARRYING OUT THE INVENTION 

In order that the invention herein described 
may be more. fully understood, the following detailed 
description is set forth. 
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In the description the following terms are 

employed: 

Nucleotide — A monomeric unit of DNA or RNA 
consisting of a sugar moiety (pentose), a phosphate, and 
a nitrogenous heterocyclic base. The base is linked to 
the sugar moiety via the glycosidic carbon (l f carbon of 
the pentose). That combination of a base and a sugar is " 
called a nucleoside. Each nucleotide is characterized by 
its base. The four DNA bases are adenine ("A")/ guanine 
("G"), cytosine ("C" ) and thymine ("T"). The four RNA 
bases are A, G, C and uracil* ("U"). 

DNA Sequence — A linear array of nucleotides 
connected one to the other by phosphodiester bonds between 
the 3 1 and 5' carbons of adjacent pentoses. 

Codon —A DNA sequence of three nucleotides (a 
triplet) which encodes through mRNA an amino acid, a 
translation start signal or a translation termination 
signal. For example, the nucleotide triplets TTA, TTG, 
CTT, CTC, CTA and CTG encode for the amino acid leucine 
("Leu"), TAG, TAA and TGA are translation stop signals 
and ATG is a translation start signal. 

Reading Frame — The grouping of codons during 
translation of mRNA into amino acid sequences. During 
translation the proper reading frame must be maintained. 
For example, the sequence GCTGGTTGTAAG may be translated 
in three reading frames or phases, each of which affords 
a different amino acid sequence: 

GCT GGT TGT AAG — Ala-Gly-Cys-Lys 

G CTG GTT GTA AG — Leu-Val-Val 

GC TGG TTG TAA G — Trp-Leu- ( STOP ) 

Polypeptide — A linear array of amino acids 
connected one to the other by peptide bonds between the 
«-amino and carboxy groups of adjacent amino acids. 

Genome — The entire DNA of a cell or a virus. 
It includes inter alia the structural genes coding for 
the polypeptides of the substance, as well as operator, 
promoter and ribosome binding and interaction sequences, 
including sequences such as the Shine-Dalgarno sequences. 
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Structural Gene — A DNA sequence which encodes 
through its template or messenger RNA ( "mRNA" ) a sequence 
of amino acids characteristic, of a specific polypeptide. 

Transcription — The process of producing mRNA 
5 from a structural gene. 

Translation — The process of producing a polypep- 
tide from mRNA. 

Expression — The process undergone by a struc- 
tural gene to produce a polypeptide. It is a combination 
10 of transcription and translation. 

Plasmid — A non-chromosomal double-stranded DNA 
sequence comprising an intact "replicon" such that the 
plasmid is replicated in a host cell. ' When the plasmid 
is placed within a unicellular organism, the characteris- 
es tics of that organism may be changed or transformed as a 
result of the DNA of the plasmid. For example, a plasmid 
carrying the gene for tetracycline resistance (Tet ) 
transforms a cell previously sensitive to tetracycline 
into one which is resistant to it. A cell transformed by 
20 a plasmid is called a " trans formant" . 

Phage or Bacteriophage — Bacterial virus many of 
which consist of DNA sequences encapsidated in a protein 
envelope or coat ("capsid"). 

Cloning Vehicle — A plasmid, phage DNA or other 
25 DNA sequences which are able to replicate in a host cell, 

which are characterized by one or a small number of 
endonuclease recognition sites at which such DNA sequences 
. may be cut in a determinable fashion without attendant 
loss of an essential biological function of the DNA, 
30 e.g. , replication, production of coat proteins or loss of 

promoter or binding sites, and which contain a marker 
suitable for' use in the identification of transformed 
cells, e.g. , tetracycline resistance or ampicillin resist- 
ance. A cloning vehicle is often called a vector. 
35 Cloning — The process of obtaining a population 

of organisms or DNA sequences derived from one such 
organism or sequence by asexual reproduction. 



Recombinant DNA Molecule or Hybrid DNA — A 
molecule consisting of segments of DNA from different 
genomes which have been joined end-to-end outside of 
living cells and have the capacity to infect some host 
cell and be maintained therein. 

Expression Control Sequence — A sequence of 
nucleotides that controls and regulates expression of • 
structural genes when operatively linked to those genes. 
They include the lac system, the trp system, major operator 
and promoter regions of phage X, the control region of fd 
coat protein and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells 
• and. their viruses. 

Referring now to Figure 1, we have shown therein 
a schematic outline of one embodiment of a process for 
preparing a mixture of recombinant DNA molecules, some of 
which are characterized by inserted DNA sequences that 
code for polypeptides having an immunological or biolog- 
ical activity of human leukocyte interferon. 

PREPARATION OF POLY(A) RNA CONTAINING 
HUMAN INTERFERON mRNA ( IFN-amRNA) 

Human leukocytes were, induced for 5 hours at 
37 °C with Sendai virus and extracted to yield a poly (A) 
RNA mixture containing human leukocyte interferon mRNA 
("BuIFN-amRNA" ) . Induction was by the Cantell procedure 
( The Interferon System , pp.- 130-31 and the references 
. cited therein). The poly(A) RNA mixture is illustrated 
without regard to its actual proportions in Figure 1. 
Induced leukocytes were harvested and 10 11 cells were 
resuspended in 1 1 of a solution containing 8 g NaCl, 
0.2 g KC1, 1.15 g Na 2 HP0 4 *2H 2 0 and 0.2 g KH 2 P0 4 dissolved 
in 1 1 of water ("PBS") and added slowly with vigorous 
stirring to 17 1 20 mM Tris-HCl (pH 7.5), 1 mM EDTA ("TE 
buffer"), 2% sodium dodecyl sulfate ("SDS") in a 50 1 
separatory funnel. Self -digested Pronase (Calbiochem) 
was added to 200 ua/ml and the solution sti rrpd for 1 b 



at. room temperature. 10 6 counts/minute ("cpm") of 125 I- 
globin mRNA were added as a marker for recovery of the 
poly (A) RNA and to control for mRNA degradation during 
subsequent steps. 2M Tris-HCl (pH 9 ) in an amount equal 
to 1/20 of the total volume ("1/20 vol") was added and the 
mixture extracted with vigorous stirring with 15 1 of 
redistilled phenol for 10 min. Three 1 chloroform were 
added, and the mixture stirred for 5 min. After allowing 
30 min for phase separation, the aqueous phase was removed 
and extracted again with phenol and chloroform. The resul- 
tant aqueous phase, totalling 19.1 1, was combined with 
60 g SDS. Nucleic acids were precipitated from the 
aqueous phase with 1/10 vol 3M. sodium acetate (pH 5.5) . . 
and 2 vol ethanol. 

After storage overnight at -20 °C, the fibrous 
nucleic acid precipitate was removed by filtration through 
a plastic tea sieve. This material was then stirred with 
200 ml TNE (50 mM Tris-HCl (pH 7.5), 100 mM NaCl, 5 mM 
EDTA) cbntaining 0.5% SDS. It subsequently dissolved on 
addition of a further 350 ml of that solution. The 
non-fibrous precipitate was collected by centrifugation 
in 1 1 Sorvall bottles in a Sorvall RC-3 centrifuge for 
15 min at 5,000 rpm and dissolved in 350 ml TNE contain- 
ing 0.5% SDS. The two TNE solutions were combined, 
extracted 3 times with 1 vol phenol, 3 times with 1/2 vol 
ether and 3 times with 1 vol ether. RNA recovery from 
the aqueous phase totalled 775 mg, as measured by absor- 
bance at 260 nm. 

Isolation of the poly(A) RNA mixture was 
achieved by repeated batch adsorption to oligo(dT) cellu- 
lose (type 7, P-L Biochemicals, Inc.). 2.7 grams oligo(dT) 
cellulose were added to 500 ml, i.e. , about half of the 
RNA-containihg solution described above. After stirring 
for 1 h at room temperature to effect adsorption of the 
poly.(A) RNA to the oligo(dT) cellulose, the cellulose and 
the mixture of mRNAs bound to it were collected by centri- 
fugation and washed once with 50 ml TNE and a second time 
with 15 ml TNE. The bound poly (A) RNA was then eluted by 



-17- 



five successive washes with 2 ml H 2 0. The yield was 
860 pg poly (A) RNA as measured by optical density (Prepar- 
ation A). The supernatant RNA solution from the first 
adsorption was subjected to two further adsorption cycles, 
exactly as described above. The second and third adsorp- 
tions yielded 600 pg and 170 pg RNA respectively and 
were combined (Preparation B). 

RNA was assayed for HuIFN-crmRNA by injection 
into Xenopus laevis oocytes ( The Interferon System , 
pp.' 93-95): RNA was dissolved in 15 mM Tris-HCl (pH 7.5), 
88 mM NaCl ( "TNK buf fer" ) . to give a concentration of 
about 1 mg/ml. Fifty nl of this solution were injected 
into each of 50 oocytes. The oocytes were incubated 
overnight at room temperature in Bairth medium (Gurdon, 
J. Embryol and Exper. Morph. , 20, pp. 401-414 (1968) and 
Barth, J. Embryol and Exper. Morph. , 7, pp. 210-222 
(1959)). The incubated oocytes were then rinsed and 
homogenized with a Pasteur pipette in a 1.5 ml Eppendorf 
centrifuge tube in 0.5 ml 52 mM Tris glycine buffer 
(pH 8.9). The mixture was centrifuged for 2 min in an 
Eppendorf centrifuge and the supernatant was drawn off 
and frozen at -20°C for assay. IFN-cr activity was 
determined by the plague reduction assay described by 
H. Strander and K. Cantell, "Production Of Interferon By 
Human Leukocytes In Vitro " , Ann'. Med . exp ■ Fenn . , 44, 
pp. 265-73 (1966). One unit IFN-a reduces virus plagues 
by 50%. The potency of an IFN-a preparation is expressed 
relative to the human reference Hu IFN-a 69/19 (Inter- 
national Symposium on Standardization of Interferon and 
Interferon Inducers, 1969). Alternatively, the assay was 
based on the reduction of cytopathic effect, essentially 
as described by W. E. Stewart, II and S. E. Sulkin, 
"Interferon .Production In Hamsters Experimentally Infected 
With Rabies Virus", Proc. Soc. Exp. Biol. Med. , 123, 
pp. 650-3 (1966), except that human CCL-23 cells were 
used and that challenge was with* Mengo virus. The oocyte 
extracts had 300 IU of IFU-or activity per pg of RNA 
injected. In later assays incubation of injected oocytes 



was for 43 hrs and only the incubation medium was assayed 
because most of the interferon is excreted by the oocytes 
(A. Colman and J. Morser, "Export Of Proteins From Oocytes 
Of Xenopus laevis " , Cell, 17,. pp. 517-26 (1979))- For 
further purification of the poly(A) RNA sufficient 0.5 M 
ethylene diamine tetraacetic acid ("EDTA") was added to 
the poly (A) RNA Preparation A to bring the concentration 
to 5 mM EDTA. The resultant solution was extracted twice 
with an equal vol of TNE-saturated phenol and 5 times 
with an equal vol of ether. It was then passed through a 
0.1-ml Chelex-100 Bio-Rad column, heated for 90 sec- at 
100°C and layered onto a*13-ml 5-23% sucrose gradient " 
containing 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.2 M 
NaCl. 10,0-00 cpnr of 5 1 -terminally 32 P-labeled DNA frag- 
ments" produced by simultaneous digestion of pBR322 with 
restriction enzymes Hind i I I and Pst I (New England Biolabs), 
were added as size markers. Centrifugation was in an - 
SW40 rotor at 10°C and 35,000 rpm for 16 h. Fractions 
(0.6 ml) were collected with an I SCO gradient collector 
at 1 ml/min. The fractions were assayed for HuIFN-crmRNA 
as described above and their position relative to the 
32 P-DNA markers was noted for future reference. In 
subsequent centrifugations , HuIFN-amRNA-containing frac- 
tions were identified relative to the markers. The 
fractions with HuIFN-crmRNA activity contained 80 ug of 
poly(A) RNA. They were mixed with 2 vol TNE containing 
0.5% SDS and 0.02% polyvinyl sulfate (in later prepara- 
tions polyvinyl sulfate was omitted) and applied to a 
50-ul oligo(dT) cellulose column. After washing the 
column as described above, 40 pg of the RNA mixture were 
eluted with 4 washes of 0.6 ml sterile distilled water. 
After ethanol precipitation, the RNA was dissolved to 
1 mg/ml in 0.5 mM EDTA. 

An assay for HuIFN-amRNA activity was carried 
out as described above on a portion of the poly (A) RNA 
precipitate. It had a specific activity of 3600 IU 
interferon/ug of RNA injected. therefore, the sucrose 
gradient had enriched the poly (A) RNA about 10- fold in 
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regard to HuIFN-amRNA. In a subsequent, similar prepara- 
tion about a '40-fold enrichment was obtained. Prepara- 
tion B was purified similarly and, since it had a similar 
specific activity as Preparation A, the two were pooled. 

At this point it should be recognized that even 
the poly (A) RNA product obtained from the sucrose gradient 
contains a very large number of different mRNA's. Except 
for the mRNA specific for IFN-a, the other mRNAs are 
undesirable contaminants (Figure 1). Unfortunately, 
these contaminant RNAs behave similarly to HuIFN-amRNA 
throughout the remainder .of the cloning process of this 
invention. Therefore, their presence in the poly(A) RNA 
will result in the ultimate preparation of a large number 
of unwanted bacterial clones which contain genes that 
code for polypeptides other than IFN-a. This contamina- 
tion presents complex screening problems in the isolation 
of the desired IFN-a hybrid clones. In the case of . 
IFN-a, the screening problem is further exacerbated by 
the lack of a sufficiently purified sample of HuIFN-amRNA 
or DNA or portion thereof to act as a screening probe for 
the identification of the desired clones. Therefore, the 
screening process for the IFN-a clones is very time-con- 
suming and difficult.. Further, because only a very small 
percentage of IFN-a clones themselves are expected to 
express IFN-« in a biologically active or immunologically 
active form, the isolation of an active clone is a "needle 
in a haystack" screening process. 

Advantageously, we may use recombinant DNA 
technology to provide *a purified sample of HuIFN-amRNA or 
cDNA or a portion thereof. This purified mRNA or cDNA 
can be used to screen rapidly very large numbers of 
bacterial clones and thereby markedly increase the proba- 
bility of isolating a clone which expresses IFN-a in an 
active form. 
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SYNTHESIS OF cDNA MIXTURE CONTAINING HuIFN-acDNA 

The poly(A) RNA enriched for IFN-ormRNA (Prepara- 
tion A+B) was used as a template to prepare single-stranded 
complementary DNA (cDNA) (Figure 1) (Cf, A. Efstratiadis - 
et al . , "Full Length And Discrete Partial Reverse Tran- 
scripts Of Globin And Chorion mRNAs " , Cell , 4, pp. 367-78 
(1975) and references cited therein). The 800-pl reac- 
tion mixture contained 40 mM Tris-HCl- (pH 7.5), 30 mM 
NaCl, 5mM MgCl 2 , 0.5 mM DTT (Cal-Biochem) , 20 pg/ml 
oligo(dT) 12-18 (P&L Biochemicals ) , 5 mM dGTP (Schwarz), 
dCTP (Laevosan) and dTTP (Sigma), 5 mM 32 P-dATP (NEN, 
specific activity 100,000 cpm/nmole), 60 pg/ml poly (A) 
RNA and 280 units avian myeloblastosis virus (AMV) reverse 
transcriptase -(a gift from Life Sciences, Inc., 
St. Petersburg, Florida). After incubation for 1 h at 
37°C, 0.5 M EDTA and 20% SDS (recrystallized ) were added 
to 10 mM EDTA and 0.1% SDS. The mixture was extracted 
with 1 vol phenol (distilled). The phenol phase was 
washed with 20p.pl 200 mM Tris-HCl (pH 7.5), 1 mM EDTA 
and 0.1% SDS, and the aqueous phases combined. These 
were extracted with an equal vol ether (Fluka, pro anal.) 
and chromatographed on a 5-ml Sephadex G-100 column in 
TNE. Fractions of 0.1 ml were collected at 0.3 ml/min. 
Fractions displaying radioactivity (as measured by 
Cerenkov radiation) were combined and 3 M sodium acetate 
added to 0.3M. The nucleic acids were precipitated with 
2.5 vol of ethanol. After storage overnight at -20°C, 
the samples were centrifuged and the supernatant dis- 
carded. The precipitate was dissolved in 180 pi distilled 
water and transferred to a siliconized Eppendorf tube. 
20 pi 5M NaOH were added and the mixture kept at room 
temperature for 40 min. 20 pi of 5M sodium acetate, 
100 pi distilled water and 500 pi ethanol were added. 
After cooling overnight at -20°C, the resulting precipi- 
tate was collected by centrifugation at a force equivalent 
to 10,000 times the force of gravity (10000 xg) for 
20 min at 0°C. The yield of single-stranded cDNA was 
10 pg. 



• Again, it is to be understood that the single- 
stranded cDNA product prepared above is in reality a 
complex mixture of a large number of different cDNAs 
transcribed from the corresponding mRNAs present in the 
poly(A) RNA mixture (Figure 1). Only a very few of these 
cDNAs are IFN-cr related, i.e. , Hi I FN-cr cDNAs . Another 
factor also acts to complicate the cDNA mixture — each 
mRNA species of the poly(A) RNA mixture is usually not 
transcribed completely. Instead, for each mRNA species 
the transcription process may stop before the end of the 
mRNA is reached. Therefore, a large variety of cDNA 
species may be produced from each mRNA species (not shown 
in Figure 1). Each species will behave more or less 
similarly in the subsequent cloning process so that 
bacterial clones will be produced which contain recombi- 
nant DNA molecules having only a fragment of the gene for 
a particular protein. The presence of these fragment-con- 
taining clones even further complicates the final clone 
screening process. 

The sizes of the various single-stranded cDNAs 
were determined by electrophoresis of a small aliquot on 
a alkaline 2% agarose gel using 30 mM NaOH, 2 mM EDTA as 
electrolyte (M. W. McDonell et al . , "Analysis Of Restric- 
tion Fragments Of T7 DNA And Determination Of Molecular 
Weights By Electrophoresis In Neutral And Alkaline Gels 11 , 
J. Mol. Biol. , 110, pp. 119-46 (1977)). The 32 F-cDNA had 
a length of 600-1000 nucleotides, relative to single- 
stranded globin cDNA and 32 F-labeled DNA fragments used 
as size markers. 

PREPARATION OF DOUBLE -STRANDED cDNA 

The single- stranded cDNA may be rendered double- 
stranded by treatment with DNA polymerase I (T. Maniatis 
et al . , "Amplification And Characterization Of A 
p-Globin Gene Synthesized In Vitro " , Cell , 8, pp. 163- 
82 (1976)). The precipitate'd single-stranded cDNA from 
above was dissolved in 200 pi H 2 0, heated at 100°C for 



2 min and incubated in 500 pi 0.1 M heat denatured potas- 
sium phosphate buffer (pH 6.9), 10 mM MgCl 2 , 10 mM DTT 
(Calbiochem), 1 mM each of dATP (Merck), dGTP (Schwarz) 
and dCTP (Laevosan), 1 mM 3 H-dTTP (NEN, specific activity 
100,000 cpm/nmole) and 150 units/ml of E. coli DNA poly- 
merase I (Boehringer-Mannheim) After 6.5 h at 15°C, 
0.5 M EDTA and 20% SDS were added to 10 mM EDTA and 0.1% 
SDS. The mixture was then extracted with 500 pi phenol 
and the phenol phase was reextracted with 250 pi 20 mM 
Tris-HCl (pH 7.5), 5 mM EDTA ( "TE buffer"). The two 
aqueous phases were combined and chroma tographed on a 
5 -ml Sephadex G-100 column under the same conditions 
described previously. Sodium acetate (3M) was added to 
0.3 M and 2.5 vol ethanol were mixed into precipitate the 
DNA. A total of 13 pg DNA was recovered. 

The DNA was treated with nuclease S lf prepared 
by the method of L C. Wiegand et al . , "Specificity Of 
The S x Nuclease From Aspergillus ^rl^gfer^, J* Biol. Chem. , 
250, pp. 8848-55 (1975). The precipitated DNA was dis- 
solved in 250 pi S x buffer (0.2 M NaCl, 50 mM sodium 
acetate (pH 4.5), 10 mM zinc sulfate) and warmed at 37°C 
for 30 min. 1.5 pi S x enzyme (11 units/pl) were added 
and the mixture incubated at 37 °C for 30 min. SDS and 
EDTA were added to 0.1% SDS and 5 mM EDTA, and the mixture 
was extracted with 250 pi phenol. The phenol phase was 
washed with 100 ul TE buffer. The aqueous phases were 
combined and chromatographed on a Sephadex G-100 (Pharma- 
cia) column in TNE; 0.1 -ml fractions were collected at 
0.3 ml/min and the Cerenkov radiation of each fraction 
was determined. 8 pg of double-stranded cDNA were 
recovered after precipitation with ethanol and sodium 
acetate as above. 

Again, it must be recognized that the double- 
stranded cDNA produced above is a mixture of a large 
number of cDNAs and fragments thereof, only a very few of 
which are HuIFN-acDNA or its fragments (Figure 1). 
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CLONING OF DOUBLE -STRANDED DNA 

A wide variety of host/cloning vehicle combina- 
tions may be employed in cloning the double-stranded cDNA 
prepared as above described. For example, useful cloning 
vehicles may consist of segments of chromosomal, non- 
chromosomal and synthetic DNA sequences, such as various - 
known bacterial plasmids, e^, plasmids from E . coli 
including col El, pCRl, pBR322 and their derivatives, 
wider host range plasmids, g_.g., RP4, phage DNA, e^, 
the numerous derivatives of ; P hage \, e.g., NM 989, and 
vectors derived from combinations of plasmids and phage 
DNAs such as plasmids which have been modified to employ 
phage DNA or other expression control sequences or yeast 
plasmids such as the 2 p plasmid or derivatives thereof. 
Useful hosts may include bacterial hosts such as strains 
of E. coli , e.g. , E. coli HB 101, E. coli X1776, E. coli 
X2282, E. coli MRCI and strains of Pseudomonas, Bacillus 
subtilis . Bacillus stearotherm ophilus and other bacilli, 
yeasts and other fungi, animal or plant hosts such as 
animal (including human) or plant cells in culture or 
other hosts. Of course, not all host/vector combinations 
may be equally efficient. The particular selection of 
host/cloning vehicle combination may be made by those of 
skill in the art after due consideration of the principles 
set forth without departing from the scope of this 
invention. 

Furthermore, within each specific cloning 
vehicle, various sites may be selected for insertion of 
the double-stranded. DNA. These, sites are usually desig- 
nated by the restriction endonuclease which cuts them. 
For example, in pBR322 the Pstl site is located in the 
gene for B-lactamase. between the nucleotide triplets 
that code for amino acids 181 and 182 of that protein. 
This site was employed by Villa-Komarof f et al . , supra , 
in their synthesis of protein displaying rat proinsulin 
antigenic determinants. One of the two Hindll endonucle- 
ase recognition sites is between the triplets coding for 
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amino acids 101 and 102 and one of the several Tag. sites 
at the triplet coding for amino acid 45 of p-lactamase in 
PBR322. In similar fashion, the EcoRI site and the PyuII 
site in this plasmid lie outside of any coding region, 
the EcoRI site being located between the genes coding for 
resistance to tetracycline and ampicillin, respectively. 
This site was employed by Itakura et al. and Goeddel 
etal. in their recombinant synthetic schemes, supra. 
These sites are well recognized by those of skill in the 
art. It is, of course, to be understood that a cloning 
vehicle useful in this invention need not have a restric- 
tion endonuclease site for insertion of the chosen DNA 
fragment. Instead, the vehicle could be joined to the 
fragment by alternative means. 
15 The vector or cloning vehicle and in particular 

the site chosen therein for attachment of a selected DNA 
fragment to form a recombinant DNA molecule is determined 
by a variety of factors, e.g., number of sites susceptible 
to a particular restriction enzyme, size of the protein 
to be expressed, susceptibility of the desired protein to 
proteolytic degradation by host cell enzymes, contamina- 
tion of the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 
characteristics, such as the location of start and stop 
codons relative to the vector sequences, and other factors 
recognized by those of skill in the art. The choice of a 
vector and an insertion site for a particular gene is 
determined by a balance of these factors, not all selec- 
tions being equally effective for a given case. 

Although several methods are known in the art 
for inserting foreign DNA into a cloning vehicle or 
vector to form a recombinant DNA molecule, the method 
preferred for a first construction in accordance with 
this invention is described in Villa-Komarof f et al. , 
35 supra , and displayed in Figure 1.. This method is charac- 

terized by digesting the plasmid (in particular pBR322) 
with that restriction enzyme specific to the site chosen 
for the insertion (in particular PstI ) and adding dGMP 
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tails to the termini by terminal transferase. dGMP tails 
are added to the 5 1 termini of the cut plasmid to regener- 
ate the Pst I site and permit linkage to a cDNA fragment 
carrying- the complementary tails. In similar fashion, 
the double-stranded cDNA is elongated by the addition of 
dCMP tails to the 3 1 termini to allow joining to the 
tailed plasmid. The tailed plasmid and cDNA are then 
annealed to insert the cDNA in the appropriate site of 
the plasmid and to circularize the hybrid DNA, the comple- 
mentary character of the tails permitting their cohesion 
(Figure 1). The resulting recombinant DNA molecule now 
carries a gene at the chosen restriction site (Figure 1). 

Of course, other known methods of inserting DNA 
sequences into cloning vehicles to form recombinant DNA 
molecules are equally useful in this invention. These 
include, for example, direct ligation, synthetic linkers, 
exonuclease and polymerase-linked repair reactions 
followed by ligation, or extension of the DNA strand with 
DNA polymerase and an appropriate single stranded template 
followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences or cDNA fragment inserted at the 
selected site of the cloning vehicle may include nucleo- 
tides which are not part of the actual structural gene 
for the desired polypeptide or may 'include only a fragment 
of the complete structural gene for the desired protein. 
It is only required that whatever DNA sequence is in- 
serted, a transformed host will produce a polypeptide 
having a biological or immunological activity of HuIFN-or 
or that the DNA sequence itself is of use as a hybridiza- 
tion probe to select clones which contain DNA sequences 
useful in the production of polypeptides having an immuno- 
logical or biological activity of HuIFN-a. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as to 
permit that host to express, the protein or portion thereof 
for which the hybrid DNA codes. The selection of an 
appropriate host is also controlled by a number of factors 
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recognized by. the art* These include, for example, 
compatibility with the chosen vector, toxicity of proteins 
encoded by the hybrid plasmid, ease of recovery of the 
desired protein, expression characteristics, biosafety 
5 and costs. A balance of these factors must be struck 

with the understanding that not all hosts may be equally 
effective for expression of a particular recombinant DNA . 
molecule. 

In the present synthesis, the preferred initial 

10 cloning vehicle is the bacterial plasmid pBR322 and the 
preferred initial restriction endonuclease site therein 
is the PstI site (Figure 1). The plasmid is a small 
(molecular weight approx. 2.6 megadaltons) plasmid carry- 
ing resistance genes to the antibiotics ampicillin (Amp) 

15 and tetracycline (Tet). The plasmid has been fully 
characterized (F. Bolivar et al . , "Construction And 
Characterization Of New Cloning Vehicles II. A Multi-Pur- 
pose Cloning System", Gene , pp. 95-113 (1977); J. G. 
Sutcliffe, "pBR322 Restriction Map Derived From The DNA 

20 Sequence: Accurate DNA Size Markers Up To 4361 Nucleotide 
Pairs Long", Nucleic Acids Research , 5, pp. 2721-28 
(1978)). Insertion of the DNA product in this site 
provides a large number of bacterial clones each of which 
contains one of the DNA genes or fragments thereof present 

25 in the DNA product previously prepared. Again, only a 

very few of these clones will contain the gene for IFN-a 
or fragments thereof (Figure 1). The preferred host for 
initial cloning in accordance with this invention is 
E. coli HB 101. Other experiments were conducted with 
30 " E. coli X1776, a host described in British patent 1,516,458 
and placed on deposit with the American Type Culture 
Collection, Rockville, Maryland, USA, where it has been 
assigned ATCC No. 31244. 

1. Preparation of Pstl-Cleaved, dGMP-elongated pBR322 
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Plasmid pBR322 (20 pg) was digested with 21 
units PstI endonuclease (MRE Porton Downs or New England 



Biolabs') in 150 pi 10 mM Tris-HCl (pH 7.5) , 6 mM MgCl 2 , 
50 mM NaCl, 6 mM 2-mercaptoethanol , 200 mg/pl bovine 
serum albumin ("BSA") (Calbiochem). After 2 h at 37°C, 
the mixture was extracted with 1 vol phenol-chloroform 
(1:1) and 1 vol ether and precipitated with ethanol. 

Addition of homopolymeric dGMP tails (Figure 1) 
by terminal deoxynucleotidyl transferase (TdT) (purified" 
according to F. J. Bollum, "Deoxynucleotide Polymerizing 
Enzymes From Calf Thymus Gland", in Methods in Enzymoloqy , 
(L. Grossman and K. Moldave, eds . ) , Academic Press, New 
York, 128, pp. 591-611 (1968)) was done in a 328-pl 
reaction volume containing 100 mM sodium cacodylate 
(pH 7.2), 10 mM NaH 2 P0 4 , 5 mM MgCl 2 1 mM dGTP, 50 pg/pl . 
BSA, and 3-6 units of TdT (purified as above) per pg of 
DNA. Incubation was at 37°C for 20 min. EDTA was added 
to 10 mM and the mixture extracted as above and dialyzed 
for 2 days ag'ainst TNE buffer. 

2 . Preparation of dCMP-elongated DNA 

Double-stranded DNA was elongated with dCMP 
residues by standard procedures ( E.g. , Villa-Komarof f 
et al. , supra). 150 ng of the double-stranded cDNA 
described above were incubated in 8 pi 100 mM sodium 
cacodylate (pH 7.2), 2.5 mM CoCl 2 , ' 50 pg/pl BSA, 0.1 mM. 
dCTP containing 3-6 units of purified TdT per pg of DNA 
for 8 min at 27°C and then frozen at -20°C. As before, 
the dCMP-elongated DNA is a mixture of different species, 
only a very few of which are IFN-related (Figure 1). 

3 . Preparation of Ca^-Treated E. coli X1776 

A single colony of E. coli X1776 was inoculated 
into 100 ml tryptone medium (C. Weissmann and W. Boll, 
"Reduction Of Possible Hazards In The Preparation Of 
Recombinant Plasmid DNA", Nature, 261, pp. 428-29 (1976), 
supplemented with 100 pg/ml diaminopimelic acid (Koch- 
Light Laboratories), 10 pg/ml nalidixic acid (Calbiochem) 
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and 10 ug/ml tetracycline (Achromycin , American 
Cyanamid). The culture was grown at 37°C to an apparent 
optical density of 0.6 at 650 nm (OD 650 ) (as measured in 
a Beckman DB spectrophotometer) and chilled in ice for 
30 min. The culture was then sedimented at 4000 rpm in a 
Sorvall H4 swinging bucket rotor, the cells washed with 
50 ml 10 mM NaCl, repelleted by centrifugation, and 
resuspended in 20 ml 100 mM CaCl 2 . The suspension was 
cooled in ice for 30 min, pelleted by centrifugation and 
resuspended .in 4 ml of 100 mM CaCl 2 and kept on ice 
overnight for use. E coli HB101 was prepared for trans- 
formation by the method of M. Mandel and A. Higa, 
"Calcium-Dependent Bacteriophage DNA infection", J. Mol. 
Biol- , S3, pp." 159-62 (1970). Aliguots (0.5 ml) were' 
kept frozen at -70°C and retained their activity for at 
least 3 months. 

4. Annealing of dGMP-elongated pBR322 
and dCMP -elongated DNA 

The annealing of the tailed, Pstl-cleaved 
pBR322 and tailed cDNA was as described in J. Van den 
Berg et al . , "Comparison Of Cloned Rabbit And Mouse 
p-globin Genes Showing Strong Evolutionary Divergence 
Of Two Homologous Pairs Of Introns", Nature , 276, pp. 37- 
44 (1978). 8 ng of dCMP-elongated DNA product were mixed 
with 22 ng of dGMP-elongated Pstl-cleaved pBR322 in 50 pi 
TNE buffer. Incubation was for 4 successive 1 h stages 
at 65°C, 46°C, 37°C and 20°C. 20 pi 100 mM Tris-HCl 
(pH 7.5), 100 mM CaCl 2 , 100 mM MgCl 2 and 50 pi TNE buffer 
were added and the mixture cooled in ice for 20 min. 

The product is, of course, a large mixture of 
different recombinant DNA molecules and some cloning 
vehicles without inserted DNA sequences. However, each 
recombinant DNA molecule contains a cDNA segment at the 
Pstl site. Each such cDNA- segment may comprise a gene or 
T~fragment thereof. Only a very few of the cDNA segments 
code for I FN or a portion thereof (Figure 1). The vast 



majority code for one of the other proteins or portions 
thereof whose mRNA's were part of the poly (A) RNA used in 
the process of this invention (Figure 1). 

5. Trans fection Of E. coli X1776 

With The Annealed Hybrid Plasmids 

The transfection of E. coli X1776 with the . " 
mixture of recombinant DNA molecules was as described in 
J. Van den Berg et al. , supra . P3 containment facilities 
were used for the transfection process and all subsequent 
steps in which the resulting transformed bacteria were, 
handled. The annealed pBR322 recombinant DNA molecules 
were added to 100 pi of Ca ++ -treated E. coli X1776 , . 
prepared previously, and the mixture cooled in ice for 
20 min, heated at 20 °C for 10 min, and 0.6 ml tryptone 
medium added. The mixture was plated onto; 2 tryptone 
medium agar plates supplemented as above. Transfection 
efficiency was 3.3 x 10 4 colonies per pg of annealed 
pBR322 transfecting DNA; native pBR322 gave 3 x 10 6 
colonies per pg. 

Since plasmid p3R322 includes the gene for 
tetracycline resistance, E. coli hosts which have been 
transformed with a plasmid having that gene intact will 
grow in cultures containing that antibiotic to the exclu- 
sion of those bacteria not so transformed. Therefore, 
growth in tetracycline-containing culture permits selec- 
tion of hosts transformed with a recombinant DNA molecule 
or recyclized vector. . 

After 48 h at 37°C, individual colonies were 
picked and suspended in 100 pi tryptone medium (supple- 
mented as above) in the wells of microti ter plates 
(Dynatech). After incubation at 37°C overnight, 100 pi 
40% glycerol were mixed into each well. The plates were 
stored at -20°C and a library of 100,000 individual 
clones of transformed E. coli X1776 was prepared. 

These 100,000 clones contain a variety of 
recombinant DNA molecules representing complete or partial 



copies of the mixture of mRNAs in the poly(A) RNA prepara- 
tion from iFN-producing leukocytes (Figure 2). The 
majority of these will contain only a single recombinant 
DNA molecule. Only a very few of these recombinant DNA 
molecules are related to I FN . Accordingly, the clones 
must be screened to separate the IFN-related clones from 
the others. 

SCREENING FOR A CLONE CONTAININ G HuIFN-acDNA 

There are several approaches to screen for 
bacterial clones containing human leukocyte interferon 
cDNA ( "HuIFN-acDNA" ) . These include, for example, RNA 
selection hybridization (Alwine et al., infra), differen- 
tial hybridization (T . P. St. John and R. W. Davis, 
"Isolation Of Galactose- Inducible DNA Sequences From 
Saccharomyces Cerevisiae By Differential Plague Filter 
Hybridization", Cell, 16, pp. 443-452 (1979); Hoeijmakers 
et al. , infra), hybridization with a synthetic probe 
(B. Noyes et al. , "Detection And Partial Sequence Analysis 
Of Gastrin mRNA By Using An Oligodeoxynucleotide Probe", 
Proc. Natl. Acad. Sci. USA , 76, pp. 1770-1774 (1979)) or 
screening for clones that produce the desired protein by 
immunological (L. Villa-Komarof f et al., supra) or biolog- 
ical (A. C.Y. Chang et al. , supra ) assays. We have chosen 
RNA selection hybridization as being the most convenient 
and promising method for primary screening of clones con- 
taining I FN-a cDNA . 

A. RNA Selection Hybridization Assay 

1 . overview Of The Initial Assay 

Referring now to Figure 2, recombinant DNA was 
isolated from a culture of a mixture of 512 clones from 
the above library of clones (two mixtures of 2 clones 
shown in Figure 2) (Step A). The reason for selecting 
this batch size will be explained below. The- recombinant 
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DNA molecules were cleaved, denatured and hybridized to 
leukocyte poly(A) RNA containing IFN-amRNA prepared as 
before (Step B). All recombinant DNA molecule-poly(A) 
RNA hybrids were separated from the non-hybridized poly(A) 
5 rna (Step C). The poly(A) RNA was recovered from the 

hybrids and purified (Step D). The recovered RNA was 
assayed for IFN-amRNA activity as above (Step E). If,^ 
and only if, the mixture of recombinant DNA molecules, 
contains a recombinant DNA molecule having an inserted 
10 nucleotide sequence capable of hybridizing to the IFNmRNA 

in the poly(A) RNA under stringent hybridization condi- 
tions, will the mRNA released from that hybrid cause the 
formation of IFN-or in oocytes, because mRNA released from 
any other recombinant DNA molecule-poly (A) RNA hybrid 
15 will not be I FN-a -related. If a group of 512 clones gave 

a positive response, the clones were regrouped in 8 lots 
of 64, and each lot assayed as before. This process was 
continued until a single clone responding to this assay 
was identified. 

20 There is no assurance that the recombinant DNA 

molecules and bacterial clone transformed therewith, 
which are thus identified, contain the complete IFN-acDNA 
sequence of I FN-a or even that the DNA sequence actually 
codes for IFN-a . However, the recombinant DNA molecules 

25 will certainly contain extensive nucleotide sequences 

complementary to the IFN-amRNA coding sequence. Therefore, 
the recombinant DNA molecule may at least be used as a 
•source of a probe to screen' rapidly other recombinant DNA 
molecules and clones .transformed with them to identify 

30 • further sets of clones which may contain an authentic and 

complete IFN-a nucleotide coding sequence. 

2 . Theoretical Considerations 

The conditions for the hybridization (Step B) 
are critical. The absolute concentrations and the ratio 
35 of recombinant DNA molecule and poly (A) RNA must be 

chosen so as to take into consideration reaction rate and 



stoichiometry . The proper choice is difficult to make, 
because the proportion of IFN-amRNA in the poly(A) RNA is 
.not known. In order to assure controlled and adequate 
kinetics, the hybridization was carried out under condi- 
5 tions where the concentration of DNA sequences from the 

recombinant DNA molecules was in excess as compared to 
the estimated IFN-ormRNA concentration. In a mixture of 
512 possible different recombinant DNA molecules, an • 
IFN-a-related DNA sequence ("IFN-aR DNA") will either not 
10 occur (giving a negative assay), or it will constitute at 

least about 1/512 of the recombinant DNA molecules. The 
concentration of the recombinant DNA molecule mixture and 
therefore the concentration of the IFN-aR DNA, if any, 
can thus be adjusted in the hybridization step -to ensure 
15 adequate hybridization rates. In addition, the amount of 

the IFN-aR DNA in the reaction mixture must be sufficient 
to bind enough IFN-amRNA from the poly (A) RNA to allow 
detection of IFN-cr after injection into oocytes of the 
mRNA recovered from the recombinant DNA molecule-poly (A) 
20 .. RNA hybrid. 

In order to detect IFN-a by the assays available, 
its concentration should be 100 IU/ml or higher. Because 
0.5 ml aliquots are required for replicate determinations, 
50 IU should be generated in the oocytes. The poly (A) 
25 RNA from induced leukocytes, used previously, generates 

about 500 IU IFN-a upon injection of 1 pg into oocytes. 
Therefore, at least 0.1 pg poly (A) RNA has to be injected 
to generate the needed 50 IU. Model experiments with 
rabbit globin mRNA and rabbit p-globin cDNA clones showed 
30 . * that the overall recovery of 125 I-globin mRNA in the 

oocyte relative to 125 I-globin mRNA added to the hybridi- 
zation mix was about 10%, and the recovery of mRNA activ- 
ity about 5%. Therefore, at least 0.1/0.05 = 2 pg of 
leukocyte poly (A) RNA should be used for the hybridiza- 
35 tion assay. To ensure an adequate safety margin, 12 pg 

of poly(A) RNA were used per assay, 

To calculate how much DNA from the recombinant 
DNA molecules is required to bind the IFN-amRNA in 12 pg 



of poly(A) RNA, the IFN-amRNA content of poly(A) RNA was 
estimated. On pg of poly(A) RNA generates 500 IU of IF. 
The specific activity of IFN-cr lies between 2 x 10 s and 
10 9 IU/mg protein. 500 IU of IFN-a therefore correspond 
5 to between 500/2 x 10 8 = 2.5 x 10" 6 mg (2*5 ng) and 
500/10 9 = 5 x 10" 7 mg (0.5 ng) of interferon. 



injected into an oocyte and the amount of IFN-a produced 
is unknown. In the case of p-globin mRNA, about 30 
10 molecules of protein per molecule mRNA are produced per 
hour; this value is about 6 for p-globin (JJB. Gurdon 
et al. / "Message Stability In Injected Frog Oocytes: 
Long Life Of Mammalian And p-Globin Messages", J * Mol . 
Biol. , 80, pp. 539-51 (1973 )). Assuming an average value 
15 of 20 for IFN-a , a molecular weight of 18000 for IFN-a 
* and a molecular weight of for IFN-amRNA*, then 



26 mg (18000/330000 x 20 x 24) of IFN-a should be produced 
in 2'4 h per mg of IFN-amRNA injected. If the specific 
activity of IFN-a is 2 x 10 8 /mg (2 x 10 2 IU/ng), then 
20 i ng IFN-amRNA will yield 26 x 2 x 10 2 = 5.2 x 10 3 IU of 
IFN-a. If the specific activity is 10 9 /mg (10 3 IU/ng), 
the amount of IFN-a produced would be 2.6 x 10 4 IU. 
Because 1 pg of leukocyte poly(A) RNA yields 500 IU of 
IFN-a, under the above assumed conditions, the concentra- 
25 tion of IFN-amRNA in 1 pg poly(A) RNA would fall between 

0.1 ng to 0.02 ng and the proportion of IFN-am^NA in 
^ le^ocyte poly(A) RNA would lie between lz io^OOO- and 
P 1; 50*000 - : Therefore, .12 pg of poly(A). RN^contains about 

• 1.2 ng to 0.2 ng IFN-amRNA. 
30 Should the translation ratio of the IFN-amRNA 

in the oocytes be lower by an order of magnitude than the 
average for globin mRNA, the IFN-amRNA content of the 
poly(A) RNA would be 10 times higher than calculated 
above, or between about 1:1000 to 1:5000. And, 12 pg of 
3 5 poly (A) RNA would then contain about 12 ng to 2 ng of 
IFN-amRNA. On the other hand, should the translation 
ratio of the IFN-amRNA in the oocytes be higher by an 
order of magnitude than the average for globin mRNA, the 



The relationship between the amount of IFN-amRNA 




IFN-amRNA content of the poly (A) RNA would be 10 j times 
Qjj lower tfian calculated above, or between about l:*OT^8tta- 
p and 1: 50a 1 00 0-. And, 12 \ig of poly (A) RNA wouldHhen 

contain 0.1 ng to 0.02 ng IFN-amRNA. 
5 Plasmid pBR322 has 4361 b.p. The complete cDNA 

of IFN-amRNA would add about 800-1000 b.p. to pBR322 on 
formation of pBR3 22 - 1 FN-cr cDNA to a total of about 5200- 
5400 b.p. Its' molecular weight would thus be about 12 
times (2 x 5200/800) that of the IFN-amRNA alone. There- 
ID fore, in order to bind the IFN-amRNA calculated above to 
be present in 12 pg poly (A) RNA required for the assay, 
an amount of recombinant DNA molecules equal to 12 times 
the amount of the IFN-amRNA will be required (stoichiome- 
tric amount) 

15 Because the IFN-amRNA content of the poly(A) 

RNA used to prepare the recombinant DNA molecules had 
been increased 10 to 40- fold over that of the crude 
poly(A) RNA, the group of 512 clones should have 10 to 40 
times more clones containing the desired IFN-amRNA than 

2 0 calculated from the above. 

If IFN-amRNA is 1 part in 1000 of the crude 
poly(A) RNA, then 12 pg of poly(A) RNA contain 12 ng 
IFN-amRNA and the stoichiometric amount of IFN-acDNA 
plasmid is 144 ng. Since a group of 512 clones will 

2 5 contain at least 5 with IFN-acDNA inserts, the amount of 

total hybrid plasmid DNA required is 14.8 ug (144 x 
(^.512/5 x 10~ 3 ). If IFN-amRNA is 1 part in l6*QO0 ' ; then 
12 pg of poly (A) RNA contain 1.2 ng IFN-amRNA and the 
amount of IFN-acDNA plasmid required is 14.4 ng. A group 
30 of 512 clones will contain either 0 or 1 IFN-acDNA insert, 
so that the amount of total hybrid plasmid DNA required 
is ^^pg (14.4 x 512 x 10" 3 ) . If IFN-amRNA is 1 part in 
^ then the amount of total hybrid plasmid DNA 

Required is 0.74 pg (1.44 x 512 x 10~ 3 ) . In order to 

3 5 ensure that the hybridization reaction will proceed under 

DNA excess conditions ( i.e. , excess recombinant DNA as 
compared to poly(A) RNA), 20 pg of the mixture (about 1.4 
to 30-fold excess) was chosen for the assay. 



Hybridization must be conducted under conditions 
which ensure (a) that .the hybridized portion of the 
poly (A) RNA is recovered intact and in a biologically 
active form, (b) that non-specific DNA-mRNA association 
is prevented, and (c) that the hybridization reaction 
goes to at least 75% completion. These conditions are 
most likely to be met -by hybridization in 80% formamide, 
0.4M NaCl (J. Casey and N. Davidson, "Rates Of Formation 
And Thermal Stability Of RNA : DNA And DNA: DNA Duplexes At 
High Concentrations Of Formamide", Nucleic Acids Res. , 4, 
pp. 1539-52 (1977)). In this solution, hybridization can 
be conducted at about 40°C (rather than the 60°-70°C 
required when formamide is omitted). Lower temperatures 
are preferred to minimize damage to the poly(A) RNA. We 
chose a hybridization temperature of 56°C. This is about 
3° below the T^ (J. Casey and N. Davidson, supra ) and 
about 10-13° below T^ d (Hamaguchi & Geidushek, J. Amer. 
Chem. Soc , 84, p. 13 2 29). Therefore, this temperature 
should not allow hybridization of sequences with less 
than about 87% homology, since a 1% mismatch lowers the 
T x d by 1° (T ? F. Bonner et al. , "Reduction In The Rate 
Of DNA Reassociation By Sequence Divergence", J. Mol. Biol. , 
81, pp. 123-35 (1973)). 

In the present hybridization, self -hybridization 
of DNA is not a major problem because the mixture of 
DNA 1 s being used consists of the same vector (pBR322) and 
a variety of cDNA inserts. Therefore, most of the DNA 
sequences will be heteroduplexes in which the inserts are 
available for hybridization to poly(A) RNA. It is very 
unlikely that complementary cDNA inserts which form part 
of different duplexes' will interact because of topological 
constraints. In any event, DNA : DNA reassociation is 
minimized under -the reaction conditions used (J. Casey 
and N. Davidson, supra ) . 

To determine the hybridization time required to 
ensure at least 75% reaction,, a second order rate equation 
was employed: 



Co - Ro + Ro (1 - 



In 



Ro 



(Co - Ro) 



= 3.9 h 



R = molar nucleotide concentration of hybrid- 
ized RNA 

Co = molar nucleotide concentration of initial 

DNA to be hybridized 
Ro = molar nucleotide concentration of initial 

RNA to be hybridized 
kj^ = rate constant for RNA-DNA hybridization 
t = time (sec) 

= 0.75 (75% reaction completion) 
3^ = 472 (kj^ = 1/12 Jc d (J. Casey and 
N. Davidson, supra ) 

where: k, = second order rate con- 
stant for DNA under the 
chosen conditions of 
hybridization 

and: k d = 1.7 x 10 s x L* 5 x N" 1 

(J. R. Hutton and J. G. 
We tmur , 11 Renatur ation 
Of Bacteriophage 
a X174 DNA-RNA Hy- 
brid: RNA Length 
Effect And Nucleation 
Rate Constant", J . Mol . 
Biol . , 77 , pp. 495-500 
(1973)) 
L = 900 (chain length in 
b.p.; about 900 are 
present in the full 
IFN-crcDNA insert) 
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N = 900 (complexity in b.p. 
of the hybrid chain; 
here the complexity 
is 900 because the 900 
5 nucleotides of the 

IFN-amRNA join with the 
complementary 900 
nucleotides of the 
IFN-crcDNA insert) 
10 Co = 2.5 x 10" 7 (Based on a 40 pi solution 

containing the previously 
determined 20 of recombi- 
nant DNA molecules to be 
used in the assay, again 
15 assuming that the IFN-acDNA 

insert will be 1/12 of a 
recombinant DNA molecule 
and will occur in at least 
1 of the 512 clones, and 
20 assigning 662 as the average 

molecular weight of one DNA 
base pair) 

Ro = 8.7 x 10" 8 (Based on a 40 pi solution 

containing the previously 
25 determined 12 pg of poly(A) 

RNA to be used in the 
assay, again assuming that 
the poly (A) RNA contains 
1 : 1<£hW parts IFN-amRNA 

AS 

(given the large excess of 



DNA a different proportion 
will have little effect on 
the rate of hybridization) 
and assigning 343 as the 
35 average molecular weight of 

one ribonucleotide of RNA) 



3 . Execution Of The Initial Assay 



Step A- Preparation and Cleavage of the 
Recombinant DNA Molecule Mixture 



The desired number of bacterial clones was 



inoculated onto tryp tone medium agar plates supplemented 
as above, by transferring to it an aliquot from each 
microti ter well with use of a mechanical device. After 
incubation at 37 °C, each clone had given rise to a colony 
of several mm diameter. All colonies were washed off the 
plate(s) and pooled to give an inoculum used to inoculate 
1 1 of tryptone medium supplemented as above in a 2 1 
Erlenmyer flask* The culture was shaken at 37°C to an 
apparent 0D 650 of about 0.8 (estimated visually). One 
volume of supplemented tryptone medium and chloramphenicol 
to 170 pg/ml were added to the culture which was further 
shaken at 37°C for 16 h. 20 ml chloroform were added and 
the culture shaken again for 10 min at 37°C to kill the 
bacteria (C. Weissmann and W. Boll, supra ) . The culture 
was decanted from the chloroform and the cells were 
harvested by centrifugation (Sorvall GS3 rotor) for 
15 TiHn at 6000 rpm and 4°C. About 1-2 g of cells were 
obtained for each 1-liter- preparation. The cells were 
suspended in 30 ml 20 mM Tris-HCl (pH 7.5), centrifuged 
for 20 min at 5000 rpm and 4°C (Sorvall SW rotor) and 
resuspended in 30 ml 50 mM Tris-HCl (pH 7.5). 0.25 vol 
of lysozyme solution (10 mg/ml in 50 mM Tris-HCl (pH 7.5)) 
were added and after cooling for 10 min at 0°C 0.33 vol 
(based on the vol of the original 50 mM Tris-HCl-culture 
suspension) 0.5 M EDTA (pH 8.0) were gently mixed in 
' without shaking. After another 10 min at 0°C, 1/16 vol 
(again based on the original volume) of 2% Triton X-100 
were added, After 60 min r the sample was centrifuged for 



60 ™-in at AO '000 > rpm and 0°C in a Sorvall SW rotor. The 
supernatant was transferred to a beaker containing a 
magnetic stirrer, and 3M NaQH was added with stirring 
until a pH of 12.5 was reached, as measured at 20°C, 
using a glass electrode and an Orion Research model 




601 pH meter, standardized with Beckman pH 10 Carbonate 
Buffer Standard (No. 3505). After stirring 10 min at 
20°C, the pH was adjusted to 8.5. After 3 min further* 
stirring 1/9 vol 5 M NaCl and 1 vol phenol (distilled and 
equilibrated' with 9.5 M NaCl) were added and vigorous 
stirring continued for 5 min. The phases /^J& separated 
by centrifugation (GSA Sorvall rotor) at jj^ 1 0Q0- rpm and 
0°C for 10 min. The supernatant containing Form I DNA 
(circular double-stranded DNA) was carefully removed from 
the interphase (which contains single-stranded DNA) and 
extracted 3 times with chloroform. (Phenol must be 
largely removed at this step ) . The Form I DNA fraction 
will contain those recombinant DNA molecules (pBR322-cDNA 
insert) originally used in transforming those host cells 
which form part of the 512 clones chosen for assay. 

Pancreatic RNAase A (5 mg/ml, preheated 10 min 
at 85 °C) was added to the Form I DNA to a concentration 
of 20 pg/ml and the mixture incubated 60 min at 37°C. 
1/5 vol 5 M NaCl were added and the mixture adjusted with 
30% polyethylene glycol 6000 (Union Carbide, autoclaved 
20 min at 120°C) up to a final concentration of 7.5% PEG. 
After 2-16 h at -10 °C, the precipitate was collected in a 
Sorvall SW Rotor for 20 min at SMuTOQ rpm and 0°C, dis- 
solved in 0.075 M NaCl, 0.007^M Na-citrate to an absor- 
bance of 20 at 260 nm f and adjusted to 0.5% SDS. The 
solution was incubated for 30 min at 37 °C with 0.5 mg/ml 
Pronase ( self -digested at 20 mg/ml, 2h at 37° C) and 
extracted 3 times with 1 vol distilled phenol and 2 times 
with 1 vol chloroform. The sample (up to 2 ml of a 
1 mg/ml DNA solution) was centrifuged through a 5 to 23% 
sucrose gradient ^n 50 mM Tris-HCl (pH 7.5), 1 mM EDTA 
for 15 h at ■ M^C&O . rpm and 15°C using an SW 27 Beckman 
Rotor. Fractions were collected and the OD 2 eo monitored. 
DNA-containing fractions were pooled and the DNA precipi- 
tated with sodium acetate and ethanol. 20 to 100 pg of 
the Form I DNA. mixture were recovered by centrifugation. 

Twenty pg of purified Form I DNA were digested 
in 150 pi 10 mM Tris-HCl (pH 7.5), 6 mM MgCl 2 , 50 mM 
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NaCl, 6 mM 2-mercaptoethanol , 200 pg/ml BSA or gelatin 
and 20 units Hind i 1 1 (New England Biolabs). The Hind i 1 1 
restriction enzyme cleaves the Form I DNA at a site 
within the pBR322 moiety (It is unlikely that the cDNA 
moiety is also cleaved, but if it is, the assay should 
not be substantially affected). After 2"h at 37°C, an 
aliquot (1%) was analyzed by electrophoresis through a 1% 
agarose gel in 50 mM Tris-acetate (pH 7.8), 2 mM EDTA for 
1 h at 50 mA to ascertain whether digestion was complete. 
If digestion was not complete, more Hind i I I was added and 
incubation continued for 2 h. When the Form I DNA was 
converted totally to linear molecules, Pronase 
(Calbiochem) ,. EDTA and SDS were added to 0.5 mg/ml, 10 niM 
and 0.5% respectively. After 30 min at 37°C, the solution 
was extracted with 30 yil phenol-chloroform (1:1). The 
organic phase was washed with 50 pi 20 mM Tris-HCl 
(pH 7.5), 1 mM EDTA, and the combined aqueous phases 
extracted 3 times with ether, filtered through a 0.1-ml 
Chelex cblumn, collected in an EDTA-boiled Pyrex tube 
and precipitated with 1/10 vol 3M sodium acetate and 
2.5 vol ethanol. After standing overnight at -20°C, the 
DNA was collected by centrifugation. 

Step B - Hybridization of the DNA with Poly(A) 
RNA 

Two hybridization mixtures were prepared. 
Mixture I contained 4 pi of 10-fold concentrated hybridi- 
zation buffer (4M NaCl, 0\ 1 PIPES (pH 6.4, 1,4 piperazine- 
■ diethane sulfonic acid, Sigma), 50 mM EDTA, 0.5 pi (about 
5 ng 125 I-globin mRNA (5000 cpm) and 6 pi induced leuko- 
cyte poly (A) RNA (2 pg/pl), an amount sufficient to 
generate 6000 IU of I FN when injected into oocytes. 
Mixture II contained 10 pg of the Hind i I I digested Form I 
DNA from above and 0.1 pg of PstI -digested 2-pBR322 (H3 )/ 
RcpG-4.13 (apBR322 derivative that contains the p-globin 
sequence in the Hindi I I site) (Mantei et al . , "Rabbit 
p-globin mRNA Production In Mouse L Cells Transformed 
With Cloned Rabbit p-globin Chromosomal DNA", Nature , 
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281, pp. 40-46 (1979).). The 125 I-globin mRNA in mixture 
I and the p-globin DNA in mixture II serve as internal 
positive controls for the hybridization assay. Both 
mixtures were dried in a stream of nitrogen gas. 40 pi 
of 80% formamide were added to the residue of mixture II 
and the solution was denatured for 10 min at 100 °C and 
chilled quickly in ice. The denatured solution was used - 
to dissolve the residue of mixture I and the resulting 
solution incubated at 56°C for 4 h. 

Step C - Separation Of Hybridized Poly (A) 

RNA-DNA From Non-Hybridized Poly(A) 
RNA 

After dilution to 1 ml with cold 0.9 M NaCl, 
0.09 M Na-citrate and formamide (100%) to 4% (by volume) 
the solution was filtered at 0.5 ml/min through a Milii- 
pore filter (0.45 pm pore size), the filter having been 
first tested for its capacity to retain RNA-DNA hybrids, 
because not all filters obtained from the manufacturer 
were equally efficient. 

Step D - Purification Of Hybridized Poly(A) 
RNA 

The above filter, with pc-ly(A) RNA hybrids 
attached, was immersed in 1 ml 0.15 M NaCl, 0.015 M 
Na-citrate, 0.5% SDS for 10 min at 37°C, rinsed with 
50 mM Tris-HCl (pH 7.5), 10 mM MgCl 2 , 2 mM CaCl 2 and 
placed in 0.6 ml of fresh buffer. After the addition of 
5 pi iodoacetate-treated DNAase (5mg/ml) (S. B. Zimmermann 
and G. Sandeen, Anal. Biochem. , 14, p. 269 (1966); P. A. 
Price et al. , "Alkylation Of A Histidine Residue At The 
Active Site Of Bovine Pancreatic Deoxyribonuclease" , 
J. Biol. Chem. , 244, pp. 924-32 (1969)), the filter was 
incubated at 37°C for 10 min. 

The filter was removed and the solution extract- 
ed with 1' vol phenol and 1 vol ether and passed through a 
. 0.1-ml Chelex column. 5 pg of carrier RNA (purified 



yeast RNA) were added to the solution and the RNA precip- 
itated with sodium acetate and ethaiwl^ The precipitate 
was collected by centrifugation at - £0*000 ' xg, dissolved 
in 100 pi 1 mM EDTA, heated for 9<fsec at 100°C, and TNE 
and SDS added to 2 x TNE and 0.5% SDS. The RNA was 
adsorbed to a 100-pl oligo(dT) cellulose column, eluted 
with four washes of 0.3 ml distilled water and precip- 
itated with sodium acetate and ethanol. After 16 h at 
-20 °C the precipitated- RNA was separated by centrifugation 
and dissolved in 2 pi TNK buffer. 

Step E - Determination Of IFN-amRNA Activity 

The poly(A) RNA solution from above was injected 
into 40 oocytes (about 50 nl per oocyte). The oocytes 
were incubated at 23 °C for 24-48 hours, homogenized and 
centrifuged (or the incubation medium recovered) and 
assayed as described previously for IFN-cr . 

4. Subsequent Assay - Hybridization 
To Filter-Bound DNA 

Most subsequent assays of a recombinant DNA 
molecule from a single clone were carried out with DBM or 
DPT paper-bound DNA, because the assay conditions were no 
longer critical and the assay is more convenient. DPT 
paper gave lower backgrounds and was used preferentially. 
DBM paper was prepared as described (J. C. Alwine et al . , 
"Method For Detection Of Specific RNAs In Agarose Gels By 
Transfer To Diazobenzyl Oxymethyl-Paper And Hybridization 
With DNA Probes", Proc. Natl. Acad. Sci. USA , 14, 
pp. 5350-54 (1977)). APT paper was prepared by a proce- 
dure of B Seed (pers. commun.): Sheets of Whatman 540 
paper (20 g) were agitated for 16 h at 20°C with a mixture 
of 70 ml 0.5 M NaOH, 2 mg/ml NaBH 4 and 30 ml 1,4-buta- 
nediol diglycidyl ether. The paper was then transferred 
to a solution of 10 ml 2-aminothiophenol in 40 ml acetone 
and agitated for 10 h. The paper was exhaustively washed 
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with acetone, 0.1 N HCl, H 2 0, 0.1 N HCl, H 2 0 and dried. 
APT paper was diazotized to DPT paper as described for 
the conversion of ABM to DBM paper (Alwine et al . , , supra ) . 

DNA (up to 15 pg) was bound to 50 mm 2 
diazotized ABM (DBM) or diazotized APT (DPT) paper as 
described by J.H.J. Hoeijmakers et al. "The Isolation Of 
Plasmids Containing DNA Complementary To Messenger RNA . 
For Variant Surface Glycoproteins Of Trypanosoma Brucei", 
Gene , in press, 1980) and set forth below. 

Hybrid plasmid DNA was digested with endonucle- 
ase Pst I , treated with SOD pg Pronase per ml, 0.5% SDS, 
and 10 mM EDTA for 30miri at 37°C, extracted with phenol 
and ether, passed through a 0.1-ml Chelex column, and 
precipitated with ethanol. The heat-denatured DNA (up to 
5 pg, with a small amount of 32 P-DNA added as tracer) was 
incubated overnight at 0°C with .1 cm 2 DBM or DPT paper in 
200 pi 25 mM potassium phosphate buffer (pH 6.5). Filters 
were washed three times for 5 min at room temperature 
with 50 mM potassium phosphate buffer (pH 6.5), 1% glycine 
and three times with 99% recrystallized formamide. A 
further incubation with 99% formamide for 2 min at 68°C 
was followed by three washes in 50 mM potassium phosphate 
buffer (pH 6.5) at 20°C and two washes in 0.4 M NaOH at 
37°C for 10 min. About 40-60% of the radioactivity was 
retained on the filters. The filters were incubated for 
3 h at 38 °C in pre-hybridization medium A, supplemented 
with 1% glycine, using 330 pi per filter. Medium A 
contains 50% formamide, 5 x SSC, 0.04% polyvinyl, pyrroli- 
done, 0.04% Ficoll (Pharmacia), 0.1% SDS, 25 pg. poly(A) 
(P & L) and 100 pg yeast RNA (BDH, extracted six times 
with phenol and precipitated with ethanol). The filters 
were washed twice in medium A and then hybridized for 
16 h at 38°C with poly(A) RNA as indicated (usually 
5-8 pg) in medium A under paraffin oil. The RNA was 
added as follows: one wet DNA filter was blotted and put 
in a sterile Petri dish, 20-40 pi of the RNA solution 
were pipetted on this filter and a second DNA filter 
(either a duplicate or a control) was put on top and the 



sandwich was covered with a sterile paraffin oil. After 
the hybridization the filters were successively washed in 
medium A (2 times), in a solution containing 1 x SSC, 
, 0.2% SDS, 1 mM EDTA (3 times, 10 min at 20°C each), 
5 medium A (2 h at 38°C) and in 50% formamide, 5 x SSC, 
0.1% SDS (3 times, 10 min at 20°C). Hybridized RNA was 
eluted by heating for 1 min at 100°C in 200 pi 10 mM 
Tris-HCl (pH 7.4), 1 mM EDTA and 0.1% SDS. The elution 
step was repeated twice, the eluates were combined and 
10 the RNA was precipitated with ethanol after addition of 
2 pg yeast RNA (purified as above). The washed pellet 
was vacuum dried, dissolved in 3 pi H 2 0 and injected into 
oocytes. IFN-a activity was assayed as above. 

5 . Results Of The RNA Selection 
15 Hybridization Assay 

The assays from 8 groups of 512 clones ( i.e. , 
groups T, Y, j, K, * , O, e and n were negative.. 
The assays from 4 groups of 512 clones ( i.e. , groups I, 6, 
N and k) were positive, albeit not consistently. The 
2 0 positive assays are reported in the following format: 

IU/ml of IFN-a produced by the RNA released from poly (A) 
RNA-DNA hybrid (assay from control hybridization using 
2-pBR322(H3 )/Rc3 G-4.13, supra ) ; the assays in which 
the experimental results were higher than the background 

2 5 control are underscored. 

Group IU/ml 

I <60 (<60); 110 (<20); <110 (<110); <110 (<110); 

<35 (<35) 
6 20 (<20) 

30 N 35 (<20); <110 (<110); 200 (<110) 

k <60 (<60); 60 (<20); <110 (<110); <110 (<110) 

Group k was subdivided into 8 subgroups of 64 clones 

and hybridized and assayed as before. The subgroups gave 

the following results, presented in the same format as 

3 5 above: 



-45- 



Subgroup 



lU/ml 



10 



fx 


_ I 


<35 


( <35 ) - 


<35 (<35) 


f\ 


- IT 

X J. 


130 


( <30 ) • 


<45 (<45) 


A 


T T T 
- Ill 












<20 


(<20) 




A 


- IV 


85 


(<35); 


<25 (<25) 


A 


- V 


<35 


<<35) 




A 


- VI 


<35 


(<35) 




X 


- VII 


<35 


(<35) 




A 


- VIII 


<35 


(<35) 





Subgroup A-III was subdivided, into 8 sets of 8 clones,. 



15 



20 



and hybridized 


and assayed: 


Set 


IU/ml 


A-III-1 


<20 (<20); <20 (6.0); 35 (<30) 


A-III-2 


<35 (<35); <30 (<30); 150 (<20); 




(<35); 110 (60) 


A-III-3 


<25 (<25); <30 (<30) 


A-III-4 


30 (<30); <20 (<20); <20 (60) 


A-III-5 


30(?) (<35); <20 (<20); <35 (60) 


A-III-6 


<30 (<30); <20 (<20) 


A-III-7 


<30 (<20) 


A-III-8 


<30 (<20) 



Because the first positive result was achieved with the 
set A-III-4, the individual colonies of this set (des- 
ignated A to H) were hybridized and assayed: 

A-III-4-B <35* (<35); <20 (60) 

X-IU-4-C 35 (60); 60* (<35); 111* ( < 11 ) ; 11* (<H); 
20 (<20) 

Therefore, clone A-III-4-C contains a recombinant DNA 
molecule capable of hybridizing IFN-amRNA. 



* The DBM paper method was used in this assay. 



The recombinant DNA molecule in this clone is 
designated: Z-pBR322 (Pst )/HcIF-4C ("Hif-4C"), and the 
bacterial strain containing it: E. coli X1776 (Z-pBR322 
(Pst)/HcIF-4C) ( " E. coli Hif-:4C" ) ■ This nomenclature 
indicates that the recombinant DNA molecule originated in 
Zurich (Z) and is plasmid pBR322 containing at the PstI 
site a HIFN-acDNA ( "HcIF" ) ; the particular recombinant 
DNA molecule being derived from clone \-III-4-C ("4C"). 

RECLONING AND CHARACTERIZATION OF Z-pBR322 (Pst )/HcIF-4C 

Since primary clones of transformed cells 
occasionally contain more than one species of recombinant 
DNA molecule ( Ef strati adis et al . ' , "The Primary Structure 
Of Rabbit g-globin mRNA As Determined From Cloned 
DNA ,f , Cell , 10, pp. 571-85 (1977)), Hif-4C was isolated 
from E. coli 'X!776 (Hif-4C) clones and purified as . * 
described above. Samples of Hif-4C and pBR322 were 
digested with Pst I" and analyzed by electrophoresis on a 
1% agarose gel. Hif-4C gave two bands, one with the 
mobility of Pst -cleaved pBR322, the other with a mobility 
corresponding to about 320 b.p. 

E. coli HB101 was transformed with the isolated 
Hif-4C as described above. Six clones of tetracycline- 
resistant, transformed bacteria were picked, small cul- 
tures prepared and Form I DNA purified and analyzed by 
Pst I cleavage and agarose gel electrophoresis as before. 
All samples showed cleavage patterns identical to Hif-4C. 
One of these recloned' -recombinant DNA molecules was 
designated Z-pBR322.(Pst )/HcIF-4c ("Hif-4c") and used for 
further experimentation. The lower case "c" designates a 
recloned DNA molecule. 

To determine the capacity of Hif-4c and its 
cDNA insert to hybridize to IFN-amRNA, Hif-4c (115 pg) was 
digested to completion with 125 units of PstI, extracted 
with phenol and chloroform/ and precipitated with ethanol 
as described above. An aliquot (10 pg) was 5 1 terminally 
labeled (to serve as a tracer in subsequent steps) by 
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dissolving it in 100 pi 50 mM Tris-HCl (pH 7.5), passing 
it through a 0.1-ml Chelex 100 column and treating it 
with 0.6 units bacterial alkaline phosphatase for 1 h at 
65°C. Tenfold concentrated TNE (40 pi) was added and the 
solution extracted 3 times with 1 vol phenol and 3 times 
with 1 vol chloroform. The DNA was precipitated with 
2 vol ethanol at -20°C overnight and collected by centrifu 
gation. For further purification, the sample in 0.5 ml 
TNA was adsorbed to 0.25-ml DEAE cellulose (Whatman DE52, 
prewashed with 2 ml 150 mM NaCl, 50 mM Tris-HCl (pH 7.5), 
2 mM EDTA) ( "NET-buf f er 1 ' ) , washed with 2 ml of NET buffer, 
eluted with 0.4 ml 1.5 M NaCl , 20 mM Tris-HCl (pH 7.5)*, 
2 mM EDTA and precipitated with ethanol as above. The 
DNA was incubated with v - 32 *-ATF (specific activity *" 
about 5000 Ci/mmole) and polynucleotide kinase, (A.M. 
Maxam and W. Gilbert, "A New Method For Sequencing DNA", 
Proc. Natl. Acad. Sci . USA , 74, pp. 560-564 (1977)) and 
purified by chromatography on a 3 -ml Sephadex-G50 column 
in TNE. The eluted fractions were pooled and the 32 P-DNA 
precipitated with ethanol as above; yield, about 10 7 dpm. 

The unlabeled Pstl-cleaved Hif-4c DNA (90 pg) 
was mixed with 6 x 10 5 dpm of "P-labeled PstI cleaved 
Hif-4c DNA from above and electrophoresed through a 
10 x 20 x 0.7 cm, 2% horizontal agarose gel in 50 mM 
Tris-acetate buffer (pH 7.8) using a 2.5 cm slot. An 
x-ray film was exposed to the gel and the position of the 
320-bp fragment determined. The gel strip containing the 
radioactive band (1.3 x 10 s dpm) was cut out, crushed- by 
pressing through a plastic 2-ml syringe and extracted 
overnight at 4°C by agitation with ten times the gel vol 
of NET buffer. The DNA was adsorbed to a 0.1-ml hydroxy- 
apatite column (prewashed with 1 ml NET buffer). The 
column was washed with 1 ml 0.1 M K-phosphate buffer 
(pH 7.5) and the DNA eluted with 0.2 ml 1 M K-phosphate 
buffer (pH 7.5). The eluate was diluted 10- fold with 
sterile distilled H 2 0 and the DNA adsorbed to and eluted 
from DEAE and precipitated with ethanol as described 
above. This DNA is called "Hif-4c fragment". 



The Hif-4c fragment (120 ng) was bound to DPT 
paper (0.5 x 0.5 cm) as described above. As a control, 
120 ng p-globin cDNA fragment excised with Hind i 1 1 from 
the hybrid plasmid Z-pBR322 (H3 )RcpG-4 . 13 (F. Meyer 
et al . , "Transposition Of AT-linked, Cloned DNA From One . 
Vector To Another" , Experimentia , 35, p. 972 (1979); 
N. Mantei et al . , supra ) and processed similarly. Hybrid- 
ization of duplicate filters to poly(A) RNA (in 20 pi)., 
washing of the filters and recovery of the RNA from the 
filters were as described above. After injection into 
oocytes the following IFN-cr activities were detected: 

IFN-a activity 
amount of time of (IU/ml)** 

• leukocyte poly(A) hybridi-- (duplicate 



DNA fragment 


RNA* 




zation 


assay) 


Hif-4c 


2 


.5 


16 h 


250;100 


P-globin cDNA 


2 


.5 


16 h 


4;1 ; . 


Hif-4c 


7 


.5 


16 h 


3000;1000 


p-globin cDNA 


7 


.5 


16 h 


4;30 


Hif-4c 


7 


.5 


5 h 


1000;1000 


P- glob in cDNA 


7 


.5 


5 h 


10 ;1 



Thus, Hif-4c contains an insert capable of hybridizing to 
IFN-ormRNA . 

IDENTIFICATION OF CLONES OF E. COLI CONTAINING 
RECOMBINANT DNA MOLECULES CROSS -HYBRID I ZING TO 
. THE INSERT IN Hif-4c 

Since the cDKA insert in recombinant DNA mole- 
cule Hif-4c was only about 320 b.p., or a third of the 
estimated size of IFN-ormRNA, the purified Hif-4c fragment 



* 1 pg of this RNA gave 460b IU/ml. 

** Oocyte supernatant after 48 h incubation, assayed by 
cytopathic effect reduction (W.E. Stewart, II and S.E. 
Sulkin, supra ) . 



described above was used as a probe to screen for bacter- 
ial clones containing recombinant DNA molecules having 
related hybrid DNA inserts (Figure 3). 

The 64 bacterial clones constituting subgroup 
A-III described above were stamped onto a Millipore 
membrane (8 cm diameter), placed on an agar plate (supple- 
mented with diaminopimelic acid, nalidixic acid and 
tetracycline, as above) and incubated for 24 h at 37 P C. 
The filter was placed onto a 0.75 ml- drop of 0.5 M NaOE 
and after 2-3 min transferred onto a paper towel to 
remove excess liquid; the step was repeated. The filter 
was neutralized, using 1 M Tris-HCl (pH 7.5), and washed 
with 1.5 M NaCl - 0.5 M Tris-HCl (pH 7.4) in a similar 
fashion as above and air dried. . The filter was dipped in. 
0.3 M NaCl, air dried and heated at 80°C for 2 h in a 
vacuum . 

Hif-4c Pst fragment (30 ng) was 32 P-labeled by 
nick translation (A.J. Jeffreys and R.A. Flavell, "The 
Rabbit p-Globin Gene Contains A Large Insert In The 
Coding Sequence" , Cell , 12, pp. 1097-1108 (1977)) using 
3- 32 P dATP and a- 32 P dCTP (specific activity, 40 Ci/mmole 
each). The filter bearing the \-III colonies was -prehy- 
bridized in 4 x SET (SET is 0.15 M NaCl, 30 mM Tris-HCl 
(pH 8.0), 1 mM EDTA), 0.1% (w/v) Ficoll,- 0.1% polyvinylpyr- 
rolidine, 0.1% (w/v) BSA, 0.5% "SDS, and 200 pg/ml denatured, 
fragmented salmon sperm DNA for 7 h at 68 °C and hybridized 
with 2 x 10 5 cpm of 32 P-la±>eled Hif-4c fragment in 4 x SET, 
0.02% (w/v) Ficoll, 0.02% polyvinylpyrrolidine, 0.02% w/v 
BSA, 0.5% SDS and 200 pg/ml denatured salmon sperm DNA at 
68 °C for 16 h. The filter was rinsed with SET-0.5% SDS 
at room temperature, washed with 2 x SET - 0.5% SDS for 
5 h at 68 °C, replacing the solution once, and with 3 mM 
Trizma base -.at room temperature for 4 h, replacing the 
solution once. After drying the filter, an x-ray film 
was exposed to the filter for 80 h using a screen. Three 
colonies gave a strong positive response, namely X-III-7D, 
A-III-2H and X-III-4C, and 2 colonies a weak one, namely 
A-III-1E, A-III-3D. 
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Small cultures were prepared from the Hif-4c 
related clones, Form I DNA was purified, cleaved with 
PstI and analyzed by agarose gel electrophoresis as 
described above. All Form . I DNAs gave rise to a large 
fragment (plasmid pBR322 moiety.) and a small one (hybrid • 
insert). The recombinant DNA molecule from X-III-2H 
released the largest insert, namely about 900 b.p. This 
recombinant DNA molecule was designated Z-pBR322 (Pst )/ 
HcIF-2H ( u Hif-2H") and its insert "Hif-2H fragment". 

Hif-2H was tested for its capacity to bind 
IFN-ormRNA by binding it to DPT paper (4 pg/100mm 2 ) and 
hybridizing it to poly (A) RNA (0.3 pg/pl), all as 
described above, for 16 h and determining IFN-amRNA activity 

DNA sample IFN-or activity (IU/ml)* 

Hif-2H 250 + 50 (average of 4 

determinations ) 
Z-pBR322(H3 )/RcpG-4.13 30 (average of 2 determina- 

tions) 

pBR322 20 

Hif-2H was recloned as described for Hif-4C and designated 
Hif-2h. 

In a further experiment an additional set of 
E. coli clones containing recombinant DNA molecules was 
prepared and colonies hybridizing to the labeled Hif-4c 
fragment were identified. In order to ensure a high 
yield of plasmids with long cDNA inserts, part of the 
double- stranded 32 P-'labeled leukocyte cDNA prepared 
enzymatically from leukocyte poly (A) RNA. (the same cDNA 
preparation as described above) was fractionated by size 
by centrifuging through a sucrose density gradient, using 
the same procedure described for the centrifugation of 
the poly (A) RNA. The fractions containing the cDNA with 
a sedimentation velocity corresponding to a 600 b.p. DNA 



Assayed by cytopathic effect reduction. 



fragment or greater were pooled and the cDNA recovered 
after ethanol precipitation. The cDNA was elongated 
with dCMP residues, hybridized to dGMP-elongated Pst 
I -cleaved pBR322 and the hybrid DNA used to transform 
E. coli as before, except that E. coli HB101 was used. 
The bacteria were distributed onto 8-cm diameter Millipore 
filters, placed on Tryptone medium agar plates (containing 
10 pg/ml tetracycline) and grown until small colonies 
appeared. A replica filter was prepared by pressing a 
fresh, moist Millipore filter onto the colony-bearing 
filter, peeling it off, placing it face upward on an agar 
plate containing 4.4% glycerol and incubating it until 
small colonies appeared. This colony-bearing filter was 
covered with a further Millipore filter, frozen -at -55 °C • 
and stored (D. Hanahan and M. Meselson, "A Protocol For 
High Density Plasmid Screening", Sept. 1978, personal 
communication). Eighteen filters, bearing a total of 
about 5000 colonies were prepared. One replica of each 
filter was used for hybridization to the 32 P-labeled, Pst 
I-excised Hif-4c DNA fragment, exactly as described 
above. About 185 positive colonies were identified on an 
autoradiogram, recloned on Millipore filters and identi- 
fied once more by hybridization. 95 clones giving the 
strongest hybridization response were designated 
Z-pBR322 (Pst)/HcIF-SNl to SN9'5 .and used for further 
investigation. 

It is, of course, evident, given the ability of 
Hif-2h to produce a polypeptide displaying an immunological 
or biological activity of HuIFN (infra), that Hif-2h and 
other DNA sequences related to it, e.g. Hif-4c, may be 
employed in this method of clone screening equally well 
on other clones containing DNA sequences arising from 
recombinant DNA technology, synthesis, natural sources or 
a combination thereof or clones containing DNA sequences 
related to any of the above DNA sequences by mutation, 
including single or multiple, base substitutions, insertions 
inversions, or deletions to select other DNA sequences 
and clones which also code for HuIFN. Therefore, such 
DNA sequences and their identification also fall within 



this invention (e.g., infra). It is also to be understood 
that DNA sequences, which are not screened by the above 
DNA sequences, yet which as a result of their arrangement 
of nucleotides code on expression for the polypeptides 
coded for by the expression of .the above DNA sequences- 
also fall within this invention. 

FURTHER CHARACTER 1 2 AT I ON OF Hif-2h DNA INSERT' 

As described above recombinant DNA molecule 
Hif-2h contains an insert of about 900 b.p., and hybrid- 
izes to human leukocyte interferon mRNA. The following 
additional characteristics were determined. 

1 . Hybrid Arrested Translation 

If mRNA is hybridized to a cloned, complementary 
cDNA, the translation of the mRNA is inhibited, however 
heat denaturatiofc of the hybrid releases translatable 
mRNA (B.M. Paterson et al . , "Structural Gene Identifica- 
tion And Mapping By DNA -mRNA Hybrid-Arrested Cell-Free 
Translation", Proc. Natl. Acad. Sci. USA , 74, pp. 4370-74 
(1977)). 2.2 pg Pst I-cleaved Hif-2h, and as a control 
2 pg Hind i I I-cleaved 2-pBR322 (H3 )/RcpG-4 . 13 ("RcpG" ) were 
denatured in 10 pi 80% (vol/vol) deionized formamide - 20 
mM PIPES buffer (pH 6.4) for 10 min at 80°C. The solution 
was added to an Eppendorf tube into which leukocyte 
poly(A) RNA (5 pg), NaCl (4 p moles) arid EDTA (10 nmoles) 
had been dried down:. The mixture was heated for 7 h at 
48°C under a layer of paraffin oil, cooled and diluted 
with 200 pi H 2 0. The two samples were divided into equal 
parts, and one of each was heated at 100°C for 30 sec. 
The nucleic acids were precipitated with ethanol, dissolved 
in 3 pi H 2 0 and assayed for IFN-ormRNA activity in oocytes 
as above : 



Le poly(A) 
DNA RNA input 

Hif-2h (1*1 pg) 2.5 pg 
Hif-2h (1.1 pg) 



RcpG (1 pg) 
RcpG (1 pg) 

Hif-2h (0.5 pg) 



2.5 pg 

2.5 M g 
2.5 M g 



1 

1 ■ 
1 



pg 
pg 
Mg 



Treatment . 
hybridized 
hybridized and 
denatured 
hybridized 
hybridized and 
denatured 
none 
none 
none 



IFN-q (IU/mD* 
400 

2000 
3000 

3000 
2000 * 
3000 
2000 

Therefore, Hif-2h, when hybridized with poly(A) RNA, 
inhibited the translation of the IFN-amRNA in the poly (A) 
RNA; after denaturing the hybrid, the IFN-amRNA was again 
translatable. This experiment confirms that Hif-2h 
contains sequences complementaxy to IFN-amRNA. 

2. Analysis By Restriction Enzyme Cleavage 
And Determination Of Nucleotide/Amino 
Acid Sequences And Restriction Map 

Digestions of Hif-2h with various restriction 
enzymes (New England Biolab) were carried out, and the 
resulting products analyzed by agarose gel electrophore- 
sis. The underlined fragments are not common to pBR322 
and Hif-2h: 



Restriction enzyme 

PstI 

EcoR I 

Bglll 



Fragment sizes 



Hif-2h** 
885 + 20,4361 
1426 ,3820 
5246 



PBR322 



*** 



4361 
4361 
not cleaved 



* The oocyte medium was assayed after 48 h by the 
cytopathic effect inhibition method. 

** AlthougHv internally consistent herein the absolute 
size of these Nfragments is best indicated by reference to 
Figs 8-10 * 



*** From Sutcliffe ( supra ) . 
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EcoRI + Ball I . 336 , 4960 4361 
EcoRI + PstI 209,676,748 748,3611 
3611 

Bsp l 921 , 587 , 540 , 504 , 587 , 540 , 

— 61 504', 457, 

457,434,2 X 231, 434,267, 

234 

+14 fragments +14 fragments 

200 bp 200 bp 

Mboil 1616 , 884 not done 

and others 



In addition, 5 1 terminally 32 P-labeled PstI cleaved Hif-2h 
was cleaved with several restriction enzymes and the sizes 
of the radioactive fragments derived from the cDNA insert 
15 in that recombinant DNA molecule were determined: 

Restriction enzyme . 3 2 P - fragments * 



20 



EcoRI 


676,209 


Hindi I I * 


no cleavage 


Bspl 


799, 86 


Hpall 


no cleavage 


Hhal 


no cleavage 


BamHI 


no cleavage 


Hinf 


210, 62 


Balll 


545,340 



25 From these data a restriction map of Hif-2h was deduced 
(Figure 4) . jcg £e -- map-4ii j n ol.. tin Phri fi ve in - cogard to tha 
/ ' *afe o o luto - Jx)Cation of tho - ci fees-? — and maybe * incomplete in 

regard to MboII sites within the insert. Only the sites 
closest to the insert are given within the pBR322 moiety. 
30 The arrow indicates the orientation of the IFN-or cDNA coding 
strand. 

Although the actual structure of the Hif-2h 
fragment or other inserts in clones of this invention or 
the amino acid sequence or structure of the polypeptides 
3 5 coded therefrom is not required for one of skill in the 
art to make and use the invention described and claimed 
herein, the above data and restriction map were included 



40 



Although internally consistent herein the absolute 
size of \hese fragments is best indicated by reference to 



Figures 8V10 



in the original application hereto as the best available 
information on the structure of fragment at the time of 
filing the original application. Since that time, as 
expected ( supra , p. 9, lines 27-29), these data and 
restriction map for the Hif-2h fragment have been refined 
using well-known techniques of nucleotide sequencing and 
restriction analysis. E.g. , A. M. Maxam and W. Gilbert, 
11 A New Method For Sequencing DNA 11 , Proc. Natl. Acad. Sci. 
USA , 74, pp. 560-64 (1977). Plasmid DNA was prepared by 
Method B (N. M. Wilkie, et al . , "Hybrid Plasmids Contain- 
ing An Active Thymidine Kinase Gene Of Herpes Simplex 
Virus 1", Nucleic Acids Research , 7, pp. 859-77 (1979)) 
and restricted by various restriction enzymes essentially 
as recommended by the supplier, except that 200 pg/ml gel- 
atin replaced the bovine serum albumin in the enzyme 
buffers. ( EcoRI was a gift from W. Boll, Bsp l a gift from 
A. Kiss and other enzymes were obtained from New England 
Biolabs . ) 

Restricted DNA (20 pg) was extracted with 
phenol, precipitated with ethanol, dissolved in 0.05 M 
Tris-HCl (pH 8), and passed over a small column of 
Chelex-100. Fragments with flush or 5 1 -overhanging ends 
were dephosphorylated by treatment with 0.2 units calf 
intestinal alkaline phosphatase (Boehringer) per pmol 
DNA 5' ends in 200 pi 0.05 M Tris-HCl (pH 8) for 60 min 
at 37° C. The enzyme was inactivated by heating 60 min at 
65°C. For DNA fragments with 3 ! overhanging ends, bacterial 
alkaline phosphatase (Worthington) was used as described 
(A. M. Maxam and W. Gilbert, supra ) except that incubation 
was at 65 °C for 30 min. The dephosphorylated DNA was 
purified by adsorption to and elution from DEAE-cellulose 
(W. Muller et al . , "Site-Directed Mutagenesis In DNA: 
Generation Of Point Mutations In Cloned B-Globin Comple- 
mentary DNA At The Positions Corresponding To Amino 
Acids 121-123", J. Mol. Biol ., 124, pp. 343-58 (1978)) or 
subjected to polyacrylamide gel .electrophoresis where 
required (see below). Fragments recovered from a 
polyacrylamide (or agarose) gel in 0.15 M NaCl, 0.05 M 



Tris-HCl (pH*8) were adsorbed to a 0.1-ml hydroxyapatite 
(Biorad HTP) column, washed 4 times with 1 ml 0.1 M 
potassium phosphate buffer (pH 7) and eluted with 0.3 ml 
1 M potassium phosphate buffer (pH 7). The solution was 
diluted . tenfold and the DNA .adsorbed to DEAE cellulose 
and recovered as described (W. Muller et al . , supra ) . 

After ethariol precipitation, the DNA was 
5 1 -terminally labeled with [y- 32 P] ATP (12-34 pCi per " 
pmol DNA 5 f ends) and polynucleotide kinase (New England 
Biolabs or P-L Biochemicals Inc. ) essentially as described 
by A. M. Maxam and W. Gilbert, supra , except that the DNA 
was not denatured before the kinase reaction. Specific 
activities of 1-1,5 yCi [ 32 P] phosphate per pmol DNA 
5 'ends were obtained. 

For sequencing, labeled fragments were cleaved 
with a second restriction enzyme and the products separa- 
ted by electrophoresis through a 5% poly acryl amide gel in 
tris-borate-EDTA buffer. The desired fragments were' 
extracted from the gel and purified (Muller et al . , 
supra ) . The various fragments for sequencing were pre- 
pared as follows (the number indicates the nominal frag- 
ment chain length in base pairs, the labeled site is 
indicated by an asterisk): 

(1) cleavage of Hif-2h with Bsp l , isolation of 
, Bsp I- Bsp I-232 and Bsp I- Bsp I-949 by 5% poly acryl amide gel 

electrophoresis in Loening's buffer (U. E. Loening "The 
Fraction Of High Molecular Weight Ribonucleic Acid By 
-Polyacrylamide Gel Electrophoreses", J. Biochem . p. 102 
(1967)); 

(2) cleavage of Hif-2h with Bsp l , labeling, 
cleavage with Pst I , isolation of Bsp I*- Pst I-83 and 
BspI*- Pst I-827; 

(3) cleavage of Hif-2h with Bql ll , labeling, 
cleavage with Pst I , isolation of Bgl II*- Pst I-336 and 
Bql II*- Pst I-570; 

(4) cleavage of Hif-2h with Mbo II , labeling, 
digestion with Pst I and Hind i! (to cleave an interfering 



350 bp pBR322 . fragment ) , isolation of Mbol I *-PstI-519 and 
Mbo II*- Pst I-351; 

(5) cleavage of Hif-2h with Eco RI , labeling, 
cleavage with PstI, isolation of EcoRI *-PstI -708 and 
EcoRI *-PstI-198; 

(6) cleavage of Hif-2h with Pst I , labeling, 
cleavage with Bgl ll , isolation of PstI *-Bcjl 1 1-570 and 
Pst I*- Bgl II-336; 

(7) cleavage of Hif-2h with Ava il , labeling, 
cleavage with Pst I and Bgl ll , isolation of Aval I* -PstI -186 
and AvaII*-Bgl II-147; 

(8) cleavage of Hif-2h with PvuII, labeling, 
cleavage with Pst I and Bgl ll , isolation of PvuII* -PstI -486 . 

• The .fragments were degraded according to the • 
method of A. M. Maxam and W. Gilbert, supra , with the 
modifications described in protocols provided by the same 
authors in September 1978. The products were fractionated 
on 0.1 x 25 x 36 cm 12% polyacrylamide gels (acrylamide/ 
bisacrylamide = 18/1) in 50 mM tris-borate, 1 mM EDTA 
(pH 8.3), with runs of 2, 8, 18 and 26 h at 900 V following 
a 6 h prerun at 700 V. Best results were obtained when 
the gels were kept at room temperature 2-3 days before 
use . 

Each stretch of the cDNA insert was sequenced 
from both strands and each restriction site which served 
as labeled terminus was sequenced using a fragment spanning 
it. The nucleotide sequence thus obtained is depicted in 
Figures 8-10. As is to be expected the positions of the 
various restriction sites in this insert are more absolutely 
located than those determined by restriction enzyme 
cleavage alone and depicted in Fig. 4. 

Referring now to Figs. 8-10, the heteropolymeric 
part of the. insert is flanked by 23G residues at the 
5 ! end and by. 7 A residues (probably reflecting the poly(A) 
terminus of the mRNA) followed by 15C residues at the 
3V terminus. For reference, the. insert is numbered from 
the first nucleotide following the dG tail to the last 
nucleotide before the polyA residues. An ATG initiation 



triplet in position 57-59 and a TAA termination triplet 
at position 624-626 define a reading frame uninterrupted 
by nonsense codons. Both other reading frames in this 
region of the insert contain 18 and 12 nonsense codons 
respectively. Moreover, the only other sequences, i.e. ,, 
in different reading frames, flanked by an ATG or GTG and 
a termination signal, which code for a polypeptide of ^ 
25 amino acids or 'more, are located between nucleotides 226 
and 304, 640 and 778 and 683 and 743, respectively. 
Therefore, the region between nucleotides 57 and 626 most 
likely includes the nucleotide sequence of the Hif-2h 
fragment that codes for a polypeptide displaying a biolog- 
ical or immunological activity of IFN-or in accordance 
with this invention. 

It should of course be understood that cloned 
cDNA from polyA RNA by the usual procedures /(A. Efstratiadis 
et al . , supra ) lacks 5 1 terminal nucleotides and may even 
contain artif actual sequences (R. I- Richards et al . , 
"Molecular Cloning And Sequence Analysis Of Adult Chicken 
B-Globin cDNA" , Nucleic Acids Research , 7, pp. 1137-46 
(1979)). Therefore, it is not certain that the ATG 
located at nucleotides 57-59 is in fact the first ATG of 
authentic mRNA . However, for the purposes of the follow- 
ing description, it Is assumed that the ATG at nucleo- 
tides 57-59 is the first ATG of authentic mRNA. 

By comparing the polypeptide coded by this 
region of the insert with that sequence of 35 amino 
" terminal amino acids of authentic human lymphoblastoid 
interferon — SerAspLeuProGlnThrHisSerLeuGlyAsnArgArgAla- 
LeuIleLeuLexLAlaGlnMetGlyArglleSerLeuPheSerCysLeuLys- 
AspArgHisAsp — determined by K. C. Zoon et al . , supra and 
M. Hunkapiller and L. Hood, supra , it appears that the 
chosen reading frame is correct and that nucleotides 57-124 
may code for a signal sequence which precedes the nucleotide 
sequence coding for the "mature" polypeptide because 
alignment of the published- sequence with the determined 
sequence (from the 24th amino acid onward) displays 
extensive coincidence ( i.e. , 26 of 35 amino acids). 



In eukaryotic mKNAs the first AUG triplet from 
the 5 1 terminus is usually the initiation site for protein 
synthesis (M. Kozak, "How Do Eukaryotic Ribosomes Select 
Initiation Regions In Messenger RNA", Cell/ 15, pp. 
5 1109-25 (1978)). The codon in the Hif-2h fragment corres- 
ponding to the first amino acid of lymphoblastoid inter- 
feron is 22 codons from the first AUG (and 14 codons from 
the second one) indicating that the sequence coding for 
interferon may be preceded by a sequence dete rmini ng a 

10 signal peptide of 23 (or less likely 15) amino acids. 
The longer of the presumptive signal sequences contains 
an uninterrupted series of 11 hydrophobic amino acids 
(and the shorter one, one" of 6 hydrophobic amino acids). 
This accumulation of hydrophobic residues is characteristic 

15 of signal sequences ( cf . , B. D. Davis and P. C. Tai, "The 
Mechanism Of Protein Secretion Across Membranes", Nature , 
283, pp. 433-38 (1980)) 

The nucleotide sequence apparently corresponding 
to "mature" IFN-a polypeptide comprises 498 nucleotides, 

20 which code for 166 amino acids. Assuming that there is 
no carboxyterminal processing, . the ^aolecular weight of 
the interferon polypeptide is 4r£*3SS-. The base composi- 
tion of the coding sequence is 50% GC. The codon usage 
within the interferon coding sequence is in reasonable 

2 5 agreement with that compiled for mammalian mRNAs in 

general (R. Grantham, et al . , "Codon Catalog Usage And 
The Genome Hypothesis" , Nucleic Acids Research , 8, 
pp. 49-62 (1980)). Any deviations observed may be ascribed 
to the small numbers involved. 
30 The structure of the polypeptide depicted in 

Figure 8-10 for the Hif-2h fragment, of course, does not 
take into account any modifications to the polypeptide 
caused by its interaction with in vivo enzymes, e.g. , 
glycosolatioh. Therefore, it must be understood that 

3 5 this structure may not be identical with IFN-a produced 

in vivo , but it still has very similar, if not identical, 
biological and immunological properties. Neither, does 
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this structure exclude the likelihood that other modifi- 
cations such as mutations, including single or multiple, 
base substitutions, deletions, insertions, or inversions 
or chemical derivatizations of this structure will not 
produce compounds that also display I FN -a activity. 

3 . Determination Of The Plus Strand 
Of The Inserted IFN-crcDNA 

The DNA strand that has the same sequence as 
the mRNA is designated as plus strand, and its complement 
as minus strand. The plus strand of the IFN-acDNA insert 
uas, identified as outlined. in Figure 5. Hif-2h DNA was. 
cleaved with the restriction enzyme Bql ll , the termini 
labeled with 3 2 P-phosphate (as described above for Pstl- 
cleaved termini) and the DNA digested with PstI , to- give 
longer (545 b.p. (570 b.p. as determined in the more 
refined analysis reported above)) and shorter 340 bp 
(336 bp as determined in the more refined analysis 
reported above)) radioactive fragments. These fragments 
were denatured and hybridized to poly (A) RNA from induced 
leukocytes in 80% formamide, 0.4 M NaCl, i.e. , under 
conditions where DNA- DNA reassociation does not occur 
( supra ) . The nucleic acids were digested with nuclease 
Si, which degrades all single-stranded nucleic acids, in 
particular the non-hybridized 32 P-DNA, and the products 
-were separated on a polyacrylamide gel (R. F. Weaver and 
C. Weissmann, "Mapping Of . RNA By A Modification Of The 
Berk-Sharp Procedure 1 ' / Nucleic Acid Research , 7, pp. 1175-93 
(1979 )) . An autoradiogram showed that only the shorter 
-nucleotide fragment had been hybridized and protected by 
the poly(A) RNA, identifying the 5' -labeled shorter- 
nucleotide strand as the minus strand. The orientation 
of the plus strand is therefore as given in Figure 4 and 
Figure 5 (right hand side)'.. 



'4. Demonstration That Poly(A) RNA From 
Non- Induced Human Leukocytes Does 
Not Hybridize To Hif-2h DNA 



An experiment identical to that described in 
the preceding section was carried out, however the poly(A) 
RNA was from non-induced human leukocytes, prepared by 
the same procedure as in the case of Sendai virus-induced 
leukocytes. No detectable amount of. labeled DNA was 
protected. By comparison to the results of the preceding 
section the poly(A) RNA from non-induced cells contains 
less than about 1/20 the * amount of mRNA hybridizable to 
Hif-2h than does poly(A) RNA from induced cells. 

SYNTHESIS OF A POLYPEPTIDE WITH INTERFERON ACTIVITY 
BY E. COLI CONTAINING RECOMBINANT DNA MOLECULES 
RELATED TO Z-pBR322,( Pst )/HcIF-4c 

The Pst I site of pBR322 lies within the p-lacta- 
mase (penicillinase) gene. Therefore, when a coding DNA 
segment ( e.g. , a cDNA comprising all or part of a gene) 
is ligated into the position in the proper orientation 
and proper reading frame, a fused protein may result. 
The protein would consist of the amino-terminal portion 
of p-lactamase followed by the amino acid sequence for 
which the inserted DNA sequence codes (L. Villa-Komarof f 
et al . , supra ) . If the inserted DNA segment comprises a 
DNA sequence containing its own initiation signal, and 
has a sequence preceding it with a termination signal in 
phase with the p-lactamase sequence, termination and 
re-initiation may occur at the second initiation signal 
and a non-fused, active protein may result (A.C.Y. Chang 
et al . , supra ) . To ensure that the DNA insert related to 
Hif-4c was inserted in the proper reading frame for 
expression within the p-lactamase gene, a set of deriva- 
tives of pBR322, namely pKT279, pKT280 and pKT287 (con- 
structed by K. Talmadge, personal communication, 1979) 
was employed. Each of these derivatives has a Pst I site 
located such that a DNA insert ligated into that site 
will be. in a different reading frame from an insert at 



the Pst I site of the other derivatives of the set 
(Figure 6). Therefore, the set permits the insertion of 
DNA into the p -lactamase gene in all three reading frames. 
The PstI -excised insert from Hif-2h was prepared as 
5 described for the fragment Hif-4c. The Hif-2h Pst frag- 
ment (10 ng) was mixed with PstI -cleaved pBR322 , pKT279, 
pKT280 or pKT287 (10 ng in each case) in 20 pi of 10 mM 
Tris-HCl (pH 7.5), 6 mM MgCl 2 / 100 mM NaCl, 6 mM p-mercap- 
toethanol, 200 pg/ml gelatin and 0.1 mM ATP and incubated 

10 with 0.1 units T 4 DNA ligase (New England Biolabs) for 
16 h at 10°C. The resulting recombinant DNA molecules 
are designated Z-pBR322(Pst)/HcIF-2h, Z-pKT279(Pst)/HcIF-2h, 
Z-pKT280(Pst)/HcIF-2h and Z-pKT287(Pst)/HcIF-2h. E. coli 
HB101 was transformed with each of these recombinant DNA 

15 molecules and transformed colonies were selected on 

tetracycline-containing agar plates as described previously. 
Since tetracycline-resistant clones of transformed bacteria 
may also contain the recyclized vector, bacterial colonies 
of each set were grown on Millipore filters and colonies 

2 0 hybridizing to 32 P-labeled Hif-4c fragment were identified 
and selected as described above. These strains were 
designated as follows, 



E. 


coli 


HB101 


( Z-pBR322 (Pst )/HcIF-2h-AHl ) 


to 


(-AH3); 


E. 


coli 


HB101 


( Z-pKT2 7 9 ( P st ) /He I F-2h-AHl ) 


to 


(-AH8); 


25 E. 


coli 


HB101 


(Z-pKT280 (Pst )/HcIF-2h-AHl ) 


to 


(-AH8); 


E. 


coli 


HB101 


( Z-pKT287 (Pst)/HcIF-2h-AHl ) 


to 


( -AH8 ) . 



Extracts of some of the above strains as well 
as of some of the strains Z-pBR322 (Pst )/HcIF-SNl to 95 
were tested for IFN-o activity. Bacteria were grown in 

30 Tryptone medium to stationary phase, harvested, washed 
'with 1/20 vol (based on the vol of the culture) 50 mM 
Tris-HCl (pH 8), 30 mM NaCl and frozen. After thawing, 
the cells were resuspended in the volume indicated below 
of the previous buffer and lysozyme was added to 1 mg/ml. 

3 5 After 60 min at 0°C the suspensions were frozen (in an 
ethanol-dry ice bath) and tbawed (at 37°C) 5 times, and 

0 centrifuged 10 min at ^ffloOO rpm in a GSA Sorvall rotor. 
In some cases part of the supernatant (S30) was further 



160.66b ... 
centrifuged at d - 00 'OOO xg in a Type 65 Spmco rotor and 

the supernataifts (S100) recovered. Such supernatants 

were screened for IFN-a activity by the cytopathic effect 

reduction assay (Expt.l). The colonies showing a positive 

response in Expt. 1 were reassayed as well as 49 clones 

from the set Z-pBR322/HcIF-SN-l to SN-95 described above 

at a lower dilution (Expt* 2). 



Source of extract; 

E. coli HB 101 transformed by: 

Expt. 1* 



Z-pBR322 (Pst )/HcIF-2h 
Z-pBR322(Pst)/HdF-2h-AH-l to 3 
Z-pKT279(Pst)/HcIF-2h-AH-l to 7 
Z-pKT2 7 9 ( P s t y/Ec IF-2h-AH-8 
Z-pKT280 (Pst )/HcIF-2h-AH-2 ,6,7 
Z-pKT280 (Pst )/HcIF-2h-AH-l ,3,4,5 
Z-pKT287 (Pst )/HcIF-2h-AH-l ,2,3,4,5,8 
Z-pKT287 (Pst )/HcIF-2h-AH-6 , 7 
Source of extract; 
E. coli HB101 transformed by: 
Expt. 2** 

Z-pKT279/HcIF-2h-AH-8 
Z-pKT280/HcIF-2h-AH-l ,3,4,5 
Z-pKT287/HcIF-2h-AH-6 and 7 



Preparation 

S30 
S30 
S30 
S30 
S30 
S30 
S30 



IFN-a activity 
( IU/ml ) 



S30 



Preparation 
S30 and S100 
S30 and S100 
S30 and S100 



Z-pBR322 (Pst )/HcIF-SN-4 ,5,7,9, 

10,11,13, 
to 16 



S30 



Z-pBR322(Pst)/HcIF-SN-18 to 22, 

24,25,27, 
30 to 34 S30 

Z-pBR322(Pst)/HcIF-SN-38 to 41, 

43 to 48 S30 



7 



positive * 
? 

positive 
7 

positive 

IFN-a activity 
(IU/ml) 
300 
300 
300 



neg (<10) 

neg (<10) 
neg (<10) 



Expt. 
Expt. 



1: Extracts assayed' at 1:150 final dilution 
2: Extracts assayed at 1:6 final dilution 



Z-pBR322(Pst)/HcIF-SN-l to 3,6, 

8,12,17, 
23,26 S30 

Z-pBR322(Pst)/HcIF-SN-28, 29,36, 

37,49 S30 



Z-pBR322 (Pst )/HcIF-SN-35 , 42 



S30 



10 

10 
200 



Some of the more active producers from above were examined 
in more detail. Cultures were grown to late log phase 
(apparent OD 650 about 0.9)* and the cells lysed as above, 
in 1/50 of the culture volume. The following activities 
were found, using Z-pBR322 (Pst )/HcIF-SN32 as a negative 
control : 



Source of extract: 

E. coli HB101 transformed by: 

Z-pKT279(Pst)/HcIF-2h-AH8 
Z-pKT2 8 0 ( Ps t ) /He I F-2h-AH3 
Z-pKT287(Pst)/HcIF-2h-AH6 
Z-pBR322 (Pst )/HcIF-SN35 
Z-pBR322(Pst)/HcIF-SN42 
Z-pBR322 (Pst)/HcIF-SN32 



Preparation 

S30, S100 

S30, S100 

S30, S100 

S30, S100 

S30, S100 

S30, S100 



I FN -a . activity 9 * 
( IU/ml ) 
(dup. assays) 

100; 300 

1000; 1000 

200; 200 

1000; 1000 

300; 100 

0; 0 



It is to be understood that the actual protein produced 
by these strains has not been analyzed structurally to 
determine whether or hot it is produced fused to amino 
acids unrelated to I FN or with all or part of IFN ! s 
signal sequence. However, whatever protein is produced, 
it displays an immunological or biological activity of 
I FN. Therefore, the protein as expressed is useful. 
Most importantly, the activity of the protein demonstrates 



* Three thousand liters or larger cultures of IFN-ot 
can be grown without loss of I FN activity. 

** It is to be understood that the above reported 
expression may reflect interferon production by genes 
under the control of the penicillinase expression control 
sequence . 



that the dma i»...|i,r-t.f.-f. whl.-h ciiiJuu lor it is a DNA sequence 
related to iiu J FN-i* . J t theieiore is within the skill of 
the art to empjoy th.it . DNA sequence as is demonstrated 
herein to select other like HuIFW-t* related DNA sequences 
and to provide the has its for other constructions that 
will express mature interferon or other variants thereof 
or will improve the yield of the particular protein 
expressed. 

The level of production of a protein is governed 
by two major factors: the number of copies of its gene 
within the cell and the efficiency with which those gene 
copies are transcribed and translated. Efficiency of 
transcription and translation (which together comprise 
expression) is in turn dependent upon nucleotide sequences, 
normally situated ahead of the desired coding sequence. 
These nucleotide sequences or expression .control sequences 
define, inter alia , the location at which RNA polymerase 
interacts to initiate transcription (the promoter sequence) 
and at which ribosomes bind and interact with the mRNA 
(the product of transcription) to initiate translation. 
Not all such expression control sequences function with 
equal efficiency. It is thus of advantage to separate 
the specific coding sequences for the desired protein 
from their adjacent nucleotide sequences* and fuse them 
instead to other known expression control sequences so as 
to favor higher levels of expression. This having been 
achieved, the newly engineered DNA fragment may be inserted 
into a multicopy plasmid or a bacteriophage* derivative in 
order to increase the number of gene copies within the 
cell and thereby further to improve the yield of expressed 
protein. 

Several expression control sequences may be 
employed as. described above. These include the operator, 
promoter and ribosome binding and interaction sequences 
(including sequences such as the Shine-Dalgarno sequences) 
of the. lactose operon of E. coir ("the lac system" ) , the 
corresponding sequences of the tryptophan synthetase 
system of E. coli ("the trp system"), the major operator 



and promoter regions of phage A an< ^ °R P R^' ^ e 

control region of the phage fd coat protein, or other 
sequences which control the expression of genes of pro- 
karyotic or eukaryotic cells and their viruses. There- 
fore, to improve the production. of a particular polypep- 
tide in an appropriate host, the gene coding for that 
polypeptide may be prepared as before and removed from a. 
recombinant DNA molecule containing it and reinserted 
into a recombinant DNA molecule closer to its former 
expression control sequence or under the control of one 
of the above expression control sequences. Such methods 
are known in the art. 

Further increases in the cellular yield of the 
desired products depend upon an increase in the number of * 
genes that can be utilized in the cell. This is achieved, 
for illustration purposes, by insertion of recombinant 
DNA molecules engineered in the way described previously 
into the temperate bacteriophage \ (NM989), most simply 
by digestion of the plasmid with a restriction enzyme, to 
give a linear molecule which is then mixed with a re- 
stricted phage k cloning vehicle ( e.g. , of the type 
described by N. E. Murray et al . , "Lambdoid Phages That 
Simplify The Recovery Of In Vitro Recombinants", Molec. 
gen. Genet. 150, pp. 53-61 (1977) and N. E. Murray et al . , 
"Molecular Cloning Of The DNA Ligase Gene From Bacterio- 
phage T4", J. Mol. Biol. , 132, pp. 493-505 (1979)) and 
the recombinant DNA molecule produced by incubation with 
DNA ligase. The desired recombinant phage is then se- 
lected as before and. used to lysogenise a host strain of 
E. coli . 

Particularly useful A cloning vehicles contain 
a temperature-sensitive mutation in the repression gene 
cl and suppressible mutations in gene S, the product of 
which is necessary for lysis. of the host cell, and gene 
E, the product which is the major capsid protein of the 
virus. With this system the -lysogenic cells are grown at 
32°C and then heated to 45°C to induce excision of the 
prophage. Prolonged growth at 37°C leads to high levels 



of production of the protein, which is retained within 
the cells, since these are not lysed by phage gene products 
in the normal way, and since the phage gene insert is not 
encapsidated it remains available for further transcrip- 
tion. Artificial lysis of the cells then releases the 
desired product in high yield. 

In an initial attempt to increase the yield of 
polypeptide, displaying a biological or immunological 
activity of human leukocyte interferon, produced from 
hosts transformed with Z-pBR322 (Pst)/HcIF-SN35 by the 
processes above described, a restriction map of the DNA 
insert in the hybrid was determined. This mapping revealed 
that as compared to Hif-2h, Hif-SN35 was lacking a Pst I 
site flanking the 3 f end of the sequence, part of the 
signal sequence was missing (up to and including codon 7) 
and the Ava il site in the signal sequence had been replaced 
by a Bsp l site. Therefore, Hif-SN35 is likely a polymorphic 
or allelic varient of Hif-2h. 

The plasmid Hif-SN35 was opened with Pst I and 
the resulting DNA strand chewed back at both ends using 
standard procedures and the LAC-Alu fragment ( infra ) 
inserted therein and the plasmid reclosed. The actual 
structure of the modified plasmid, identified as 
Z-pBR322(Pst)/HcIF-SN35-AHL6, and the amino acid sequence 
at the amino terminal end of the protein produced in 
E. coli have been determined. The nucleotide sequence of 
this construction reveals that the LAC- Alu fragment was 
attached one amino acid away from the first amino acid of 
IFN-al(SN35 > . The amino acid sequence of the amino terminal 
portion of the protein expressed in E. coli revealed that 
a fused protein was produced having six amino acids fused 
to the IFN-al(SN35) sequence. However, hosts transformed 
with the modified plasmid produce about 100 times more 
polypeptide displaying a biological or immunological 
activity of human leukocyte interferon as compared to 
hosts transformed with unmodified. Z-pBR322 (Pst )/HcIF-SN35 . 

Referring now to Figure 25, another attempt to 
improve the yield of polypeptide, displaying an immunologi- 



cal or biological activity of human leukocyte interferon, 
produced by hosts transformed with 2-pBR322 (Pst )/HcIF-SN25 
(SN35 in Figure 25) is depicted. The hybrid plasmid was 
cleaved using standard procedures with Bsp l (a gift of 
Dr. Kiss). After heat inactivation (65°C, 30 min) of the . 
restriction enzyme, the mixture was adjusted to 50 mM 
Tris-HCl (pH 8) and heated (37°C / 30 min). Following 
extraction with phenol and ether, the largest, orlcDNA 
fragment was isolated on low temperature gelling agarose 
(0.8%) and Hind i I I linkers attached. The modified fragment 
was then joined to Hind i I I -cleaved plasmid HS-pBR322 (Eco )/ 
lacUV5-150 ("LAC-150" )* { a gift of H. Schaller) by 
melting the fragment-containing gel pieces (about 20 pi 
each) at 65°C, cooling to 37°C and adding 20, U per pi 
T4 DNA ligase. After 16 h at 15 °C, ligation occurred in 
the solidified gel (H. Lehrach, personal communication 
1980). * One tenth vol 100 mM Tris-Hcl (pH 7.5), 100 mM 
CaCl 2 , 100 mM MgCl 2 were added to the sample and it was 
heated 5 min at 65°C and cooled to 37°C. The samples 
were then used to transform Ca +2 treated E. coli HB101, 
incubated at 0°C for 20 min, heated at 42°C for 1 min and 
for 10 min at 20°C. After addition of 1 mol tryptone 
medium, the samples were incubated 60 min at 37 °C and 
plated on to agar plates containing ampicillin. Plasmid 
DNA was separated from these cultures, as before, and the 
hybrid plasmid containing the IFN-al insert with its 5' 
end adjoining the LAC fragment identified by restriction 
analysis. The plasmid was then cleaved with EcoRI using 
conventional procedures and digested with exonuclease 
BAL-31 (0.06 U/ml, 2-4 min at 30° C) to remove the over- 
hanging. Eco RI tail of the LAC fragment and to shorten the 
p-galactosidase coding segment. 



* This plasmid contains the lac promoter Hael 11-202 bp 
fragment (W. Gilbert et al . , "Lactose Operator Sequences 
And The Action Of Lac Repressor" in Protein Liqand 
Interactions , H. Sund and G. Blauer, eds. (Berlin, Walter 
de Gruyter), pp. 193-206 and K. Backman et al . , "Maximizing 
Gene Expression On A Plasmid Using Recombination In Vitro", 
Cell , 13, pp. 65-71 (1978)) flanked by an EcoRI linker at 
its 3 1 end. 



To ensure that the treated plasmid contained 
the complete IFN-al coding sequence, the plasmid was then 
cleaved with Bgl l I , using conventional procedures, worked 
up as before and the largest fragment separated on agarose 
gel (0.8%). This fragment was then combined with a 
Bspl- Bgl ll fragment from Z-pBR322 (Pst )/HcIF-SN35 and the 
resulting hybrid plasmid used to transform £. coli HB101 
as before. The transformed colonies were screened for- 
I FN activity and one clone having a high level of I FN 
activity was selected. This clone was designated E. coli 
HB101 (C8-IFN-al) and its hybrid plasmid C8-IFN-al. 

DNA sequence analysis of C8-IFN-al revealed 
that the coding sequence following the initiator triplet 
determined the first seven amino . acids of p-galactosidase, ; 
a Pro residue generated by the fusion, amino acids 16 to 
23 of the IFN-al signal sequence and the IFN-al (SN35) 
sequence. £. coli minicell strains (DS410 )transformed 
with hybrid plasmid C8-IFN-al produce about 50 million 
units I FN per liter. or about 2500 times more polypeptide 
displaying an immunological or biological activity of 
HuIFN as compared to minicells transformed with unmodified 
2-pBR322(Pst)/Hif-SN35. Amino acid sequencing of the 
polypeptide produced by plasmid C8-IFN-al confirms that 
the product is a fused protein having sev^n amino acids 
from p-galactosidase , one amino acid generated from the 
fusion and amino acids 16-23 of the IFN-al signal sequence 
fused to IFN-al. 

Further, examples of various constructions to 
improve the protein yields in accordance with this invention 
are discussed in connection with other forms of IFN-a 
(infra)'. 



PROPERTIES OF INTERFERON ACTIVITY PRODUCED BY 
E. COLI TRANSFORMED WITH HYBRID PLASMIDS 

1 . Sensitivity Of IFN-a Activity To Trypsin 



50 pi samples of authentic Hu IFN-a (specific 
activity, 1.2 x 10 6 U/mg; SOU) , and the S100 extracts 
described above of E. coli HB101 (Z-pKT287(Pst )/HcIF-2h-AH6 ) 
("Hif-287-6 extracts") (200 U/ml, 10 U) and of E. coli . " 
HB101 (2-pBR322(Pst)/HcIF-SN35) ( "Hif-35 extracts") 
(1000 U/ml; 50 U) were incubated with various amounts of 
trypsin, as indicated below, for 30 min at 37°C. Since 
the S100 extracts have a high protein content, while the. 
Hu IFN-a does not, a mixture of HuIFN-or and the control 
S100 extract Hif-32 was tested in parallel: 



IFN-a activity 



IFN-or preparation Trypsin (pq) (units) 



HuIFN-a (50 units in 50 |jl 

Hif-32 S100 extract) 0 50 

0.1 50 

1 50 

10 5 

50 0 

Hif-287-6 S100 extract (10 units) 0 15 

0.1 15 

1 5 

10 1 

50 0 



Therefore the IFN-a of the extracts is sensitive 
to trypsin and hence a protein. 

Hif-35 S100 extract (50 units) 0 

0.1 

1 
10 
50 



2 . Behavior On Chromatography On Sephadex G-100 



Extract Hif-35 (1 ml) and the S100 extract of 
E. coli HB101 (Z-pBR322(Pst)/HcIF-SN32) ( u Hif-32 extracts") 
were chromatographed on a 32-ml Sephadex G-100 column at 
4°C in 50mM K-phosphate buffer (pH.7.4). Cytochrome c 



30 
20 
20 
2 
0 



(0.2 mg) was added as an internal marker. The flow rate 
was 2 ml/hr and 1.0 ml fractions were collected. The 
absorbance at 280 nm, and 405 nm (cytochrome c), and the 
I FN- a activity were determined. As shown in Figure 7, the 
IFN-cr activity of Hif-35 extracts was eluted before cyto- 
chrome c, with a value of about 0.45* Therefore, the 
apparent i^l eCTilar^^ght of the substance was between 
about* ' fflfoooo cma :ftK08 6*7 no activity was detected in the 
fractions of control extract Hif-32. 

3. Inhibition Of The Interferon Activity 
Of Hif-35 And Hif-287-6 By Antibody 
Against Human Leukocyte Interferon. 

HuIFN-cr (specific activity 1.2 x 10 s IU/mg), 
and the Hif-35/Hif-287-6 S100 extracts were incubated * 
with various dilutions of sheep antiserum against Hu IFN-cr 
^Djrep. -X. Cantell, Feb. 24, 1976, specific activity 
4$8*li?$0 > units/ml ) in 100 |jl Modified Eagles Medium (MEM) 
'with 10% calf serum for 30 min at 37°C and 45 \xl were 
assayed for IFN-a activity by the cytopathic effect reduc- 
tion assay. (The antibody itself did not cause a cyto- 



pathic effect): 

Anti - 1 eukocyte- Res idual 
IFN-a Preparation interferon antibody IFN-a activity 
(units) (units) (HQ 



IFN-a 0 5 

(10) 0.18 -0.5 

9 <0.1 

450 <0.1 

Hif-35 extract 0 15 

(25) 0.18 15 

9 <0.1 

450 <0.1 

Hif-287-6 extract 0 15 

(25) 0.18 15 

9 <0.1 

450 <0.1 

none 0 < 0 . 1 

450 <0.1 



* The molecular weight determined by nucleotide sequenc- 
ing and assuming no carboxyterminal processing is 19,388. 



To show that the action of the antibody was not due to an 
unspecific effect, such as proteolytic degradation, a 
similar experiment was performed with the mouse interferon 
system: 

Anti -leukocyte- I FN activity 

I FN Preparation interferon antibody (units/ml) 

(Units ) (units/ml ) mouse system 

mouse preparation 4500 100 

(100 units) 90 100 

18 100 

Thus, antibodies directed against HuIFN-cr specifically 
inhibit the I FN -a activity of polypeptides produced in 
E. coli transformed with certain recombinant DNA molecules 
containing the HcIF-2h DNA sequence. The apparently 
lower affinity of the antibody for the I FN-a . produced in 
E . coli may reflect structural differences between the 
latter* and natural HuIFN-a, for example, absence of 
carbohydrate moiety, presence of signal sequence, or 
fusion to part of the p -lactamase sequence. 

4. Reduced Activity Of Hif-35 And 

Hif-287-6 Extracts On Mouse Cells 

Human CCL23 cells or Mouse L929 cells were 
treated with E. coli extracts, HuIFN-a (prep., K. Cantell, 
specific activity 1.2 x 10 6 units/mg) or mouse IF (N.I.H. 
standard), were challenged with virus (Mengo virus in the 
case of human cells and VSV in the case of mouse cells) 
and the IFN-o activity determined by the cytopathic 
effect reduction assay: 

I FN activity (units/ml) 
Addition human system mouse system 

mouse - interferon (120 units/ml) - 120 
Hif-35 extracts 100 13 

1000 120 

Hif-287-6 extracts 30 4 , 

300 40 

HuIFN-a (100 units/ml) *. 100 4' 

HuIFN-a (1000 units/ml) 1000 40 



These results show thatHif-35 and Hif-287-6 extracts 
have a protective action on human cells and only a slight 
effect (-10%) on mouse cells, as is typical for human 
interferon. 

5. Effect On Some Cell Function 

Extracts from E. coli HB101 ( Z-pBR322 (Pst )/Hif- 
SN35-AHL6) were compared with authentic I FN for its 
effect on- some cell functions. The E. coli . made IFN-a 
displayed the following properties of natural IFN-a: 
(1) it enhances natural killing activity of human lympho- 
cytes; (2) it enhances antibody-dependent cell-mediated 
cytotoxicity; (3) it suppresses antigen-, and mitogen-, 
induced leukocyte migration inhibition; and (4) it inhibits 
growth of IFN-sensitive Burkitt lymphoma cells.. These 
properties are indicia of E. coli synthesized IFN-al ! s . 
activity against human tumors and cancers. 

6. Activity Of IFN-a Without 
• Amino-Terminal Sequences 

IFN-a without amino-terminal sequences has also 
been made in E. coli and shown to display activity consis- 
tent with I FN activity. 

To construct the appropriate recombinant DNA 
molecule, plasmid Hif-2h ( supra ) was partially digested 
with EcoRI and BamH I and the fragment containing the 
IFN-a 1 coding sequence separated on agarose gel and 
combined with the non- IFN-a 1 coding sequence obtained 
from an Eco RI/ BamH I restriction of plasmid Hif-SN35 which 
is missing a PstI site adjacent the 3 ! end of the hybrid 
insert as compared to Hif-2h (supra). The resulting 
plasmid was then restricted with Fst l/ Bgl l to remove a 
portion of the amino terminal part of the IFN-al coding 
sequence. Inserted in its place were a series of IFN-al 
fragments prepared by digestion of plasmid Hif-2h with 
PvuII, treatment with Bal exonuclease, attachment of PstI 



linkers and restriction with Bal l I - The resulting plasmids 
thus* contained a series of IFN-al coding sequences which 
lacked various portions of their amino terminal sequences. 
One of these (plasmid 2H-M8 ) was digested with Pst I and 
its nucleotide sequence determined. Sequencing revealed 
that the plasmid 2H-M8 contained several nucleotides 
between the Pst site and the first codon (CYS) of IFNral. 
Therefore, the Pst I digested plasmid 2H-M8 was treated 
with T4 polynuclease/dATP, SI exonuclease and digested 
with Sai l . This procedure generated a series of fragments 
whose IFN-al coding sequences were missing portions of 
their amino terminal end. These fragments were then 
placed under LAC control by operatively linking them to a 
GUA-LAC fragment prepared from* plasmid Lac3V8 by digestion 
with Eco RI , treatment with exonuclease Si and digestion 
with Sai l . The resulting series of plasmids thus had 
IFN-al coding sequences lacking various portions of their 
amino terminal ends attached in a counterclockwise direc- 
tion via an AUG to a fragment containing the LAC promoter. 

Some of these plasmids were sequenced. One 
began at the fifth amino acid of IFN-al and one at the 
tenth amino acid of IFN-al. In E. coli minicells (DS410) 
both of these plasmids produced polypeptides that displayed 
I FN activity. Therefore, not all of the IFN-al protein 
is required for I FN activity. " 

IDENTIFICATION OF CLONES OF E. COLI CONTAINING 

RECOMBINANT DNA MOLECULES WHOSE DNA INSERTS 
WEAKLY CROSS-HYBRIDIZE TO THE INSERT IN Hif-4c 
AND HAVE A DIFFERENT RESTRICTION MAP THAN THE 
Hif-2h FRAGMENT 



The comparison of the first 35 amino acids of 
authentic lymphoblastoid interferon (Zoon et al . , supra , 
and M. Hunkapiller and L. Hood, supra ) and the sequence 
deduced from Hif-2h fragment shows 9 differences. In all 
cases, the codons for the differing amino acids could be 
related by or>e-base changes. The amino acid compositions 
determined directly for authentic lymphoblastoid interferon 



on the one hand and deduced from the sequence of the 
Hif-2h fragment on the other, also show striking ' differ- 
ences in regard to their content of Gly, Pro, Cys and 
Met. These differences are too large to be explained by 
polymorphism. Instead, they most likely reflect the 
existence of at least two non-allelic genes, because the 
degree of divergence of the two proteins (26% mismatch) 
is similar to that between, for example, human and sheep 
B-globin (23% mismatch). Accordingly, the clones that 
displayed weak hybridization to fragment Hif-4c, identified 
previously ( supra ) , were examined and a clone E.coli HB101 
(Z-pBR322(Pst)/HcIF-II-206) was identified. 

The hybrid plasmid Z-pBR322 (Pst)/HcIF-I 1-206 
( "HcIF-II-206" ) , of . this clone and its DNA insert 
"Hif-II-206 fragment" are weakly hybridizing to Hif-4c 
and Hif-2h fragment. coli transformed with plasmid 

Hif-II-206 produces polypeptides displaying a biological 
or immunological activity of HuIFN-a . Hif-II-206 fragment 
has a restriction map that is distinct from that determined 
for the Hif-2h fragment. A comparison of the two restric- 
tion maps is set forth in Fig. 11. Again, the absolute 
location of the restriction sites in the Hif-II-206 
fragment is not determined by restriction mapping alone. 
However, sequencing of the nucleotide sequence of this 
insert, using the standard procedures described above, 
permits more absolute determination of these locations. 
However, because of the differences in the restriction 
map of the Hif-II-206 fragment as compared to the Hif-2h 
fragment, it is clear that the interferon genes of the 
two inserts have different nucleotide sequences.^ 

Referring now to Figures 12-16, the nucleotide 
sequences of the inserted DNA sequence — Hif-II-206 
fragment — of culture HcIF-G and the inserted DNA 
sequence — Hif-2h fragment — of culture HcIF-E (infra), 
determined previously, and the corresponding amino acid 
sequences of the proteins coded for by those nucleotide 
sequences are displayed. The nucleotide sequence of the 
Hif-II-206 fragment — the Pst I fragment (790bp) of the 



plasmid DNA isolated using the procedure of Method B as 
described by N. M. Wilkie et al., Nucl . Acids Res. , 7, 
pp. 859-77 (1979) from culture HcIF-G — was determined 
using the standard procedure of A. M. Maxam and W. Gilbert, 
supra . The sequencing strategy employed is shown in 
Figure 17 . 

Restricted DNA (usually about 10 yg) was 5* 
terminally labelled as described by N. Mantei et al./ 
Gene , 10, pp. 1-10 (1980). Labeled fragments were 
cleaved with a second restriction enzyme, the products 
separated by electrophoresis through a 5% polyacryl amide 
gel in Tris-borate-EDTA buffer (A. C. Peacock' and C. W. 
Dingman, Bioch. , 6, pp. 1818-27 (1967)), extracted from 
the gel /( and purified as described .by W. Muller . et . al . ,. 
J. Mol. Biol. , 124, pp. 343-58 (1978). 

Referring now to Figure 17, the various fragments 
for sequencing were prepared as follows: 

25 and 26 — cleavage of Hif-II-206 with PstI, 
labelling, cleavage with Bgl ll , isolation of a Pst I*- Bgl II 
fragment (257bp) ("25") and a PstI*-BglII fragment (279bp) 
("26"); 

21, 22 and 23 — cleavage of Hif-II-206 with 
PvuII, labelling, cleavage with Bgl ll , isolation of a 
Pvu II*- Bgl II fragment (88bp) ("21"), a Pyu II*- Bgl II 
fragment (176bp) ("22") and a PvuI*-BglII fragment (2l4bp) 
("23"); 

11, 12, 13, and 14 — cleavage of Hif-II-206 
with Bgl ll , labelling, cleavage- with PstI, isolation of a 
Bgl II*- Pst I fragment (279bp) ("14") and a comi grating 
mixture of a Bglll* -PstI fragment and a Bgl l I * - Bgl l I * 
fragment. Cleavage of the mixture with Pvu II and isolation 
of a BqlII*-FstI fragment (257bp) ("13"), a BqlII*-PvuII 
fragment (176bp) ("12") and a Bgl II*-PvuII fragment 
(88bp) ("11" ) . 

27L, 27U, 41, 43, 44 and 45 — cleavage of 
Hif-II-206 with Hinf I , labelling, isolation of precursor 
fragments: Hinf I *- Hinf I* . (113bp) ("27P"), Hinf I* -Hinf I* 
(I46bp) ("28P"), Hinf l*- Hinf l* (159bp) ("30P"), Hinf I*- 
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Hinfl* (397bp) ("31P") and Hinf I*- Hinf I* (1522bp) (»32P M ). 
Cleavage of 28P with Mbo II and isolation of a fragment 
HinfI*-MboII (112bp) ("41"). Cleavage of 30P with MboII 
and isolation of a fragment Hinf I*- Mbo II (126bp) ("43"). 
Cleavage of 3 IP with PstI and isolation of a fragment 
Hinf I* -PstI (151bp) ("44"). Cleavage of 32P with Pst I 
and isolation of a fragment Hinf I*- Pst l (13 9bp) I "45")*. 
Strand separation of 27P to yield two strands ( , '27U" and 
"27L" ). 

All segments, except 27U amd 27L, were seguenced 
on both strands and across the restriction sites that 
served as orgins for sequencing, except for the Bgl ll 
site at position 185. 

From a comparison of the amino acid sequence 
coded for by the two inserts it is apparent that the 
Hif-I 1-206 fragment codes for an interf eron-like protein 
having one less amino acid than does the Hif-2h fragment 
(amino acid 44 (Asp) present in Hif-2h is missing in 
Hif-II-206). Moreover, 10% of the nucleotide positions 
and 17%. of the derived amino acid residues of the two 
fragments are different. 

In addition, when compared to the 3 5 amino acids 
determined for the amino terminus of lymphoblastoid inter- 
feron (K. C. Zoon et al . , Science , 207, pp. 527-28 (1980)), 
the insert Hif-II-206 codes for a protein that differs in 
5 residues from the 35 amino acid residues determined by 
Zoon et al., supra . Therefore, at least three different 
I FN genes of the leukocyte type (a type) must exist — 
Hif-2h fragment, Hif-lI-206 fragment and the gene coating 
for Zoon's I FN. In accordance with the newly proposed 
nomenclature for interferon, hereinafter the proteins 
coded for by these genes will be identified as follows: 
I FN Gene Source Protein 

Hif-2h IFN-al 

Hif-II-206 IFN-a2 

I FN from lymphoblastoid IFN-a3 
cells (Zoon et al. ■ 
( supra ) 



The differences between IFN-crl and IFN-or2 are 
also reflected in their varying activities on human CCL23 
and bovine embryonic kidney (BEK) cells: 



Extract 



Relative I FN Activity* 



CCL23 



BEK 



Ratio 



Hif-II-206 (IFN-a2) 



1.7 



1.0 



1.7:1 



Hif-SN35** (IFN-al) 



0.05 



1.0 



1:20 



Therefore, IFN-:al is about 30 times less active 



-on human cells than IFN-a2 . Yet, they are both about 
equally active on bovine cells. Therefore, IFNs may, in 
addition to their use as antiviral and antitumor or 
anticancer agents in humans, also be useful in treating 
these conditions in cattle. For example, preparations of 
HuIFN-or could be employed in a standard manner ( supra ) in 
treating FMDV and other well known viral infections of 
cattle. This is even more particularly true for IFN-crl 
since its activity on bovine cells is about 20 times 
greater than its activity on human cells. 



* E. coli HB101 containing the hybrid plasmid were 
grown in tryptone medium with shaking to an OD 6 so of 
about 1-2. The cells were harvested, weighed, resuspended 
in 1/100 or 1/20 of the original culture volume and lysed 
by the lysozyme-freeze-thaw method (S. Nagata et al., 
Nature , 284, pp. 316-20 (1980)). The S-30 supernatants 
were tested by CPE reduction assay in microti ter plates. 
Extracts from control cells were negative (<1 I.U./ml). 
Human CCL23 cells, and bovine embryonic kidney (BEK) cells 
(FLOW) were grown in MEM-10% fetal calf serum. Exposure 
to IFN-containing extracts was for 24 h. The cells were 
challenged with an appropriate dilution of Mengo virus 
and stained : 24 h later. Values were estimated visually 
relative to partially purified leukocyte IFN (preparation 
PIF, a gift of K. Cantell) of known titer. This prepara- 
tion was about 3x more active on human than on bovine cells. 

** The comparison of the restriction maps of Hif-SN35 
and Hif-2h suggest that they are polymorphic variants of 
each other ( supra ) . 



Because of the improved yield attained in the 
case of IFN-al with construction C8 ( supra ) , a similar 
construction was made for IFN-a2. Z-pBR322 (Pst )/Hif-II-206 
was cleaved completely with Bsp l and partially with PvuII 
(at PI) (Figure 28) and the 867 bp fragment was isolated 
from a 6% polyacrylamide gel. This fragment was then 
ligated to a 259G bp Pvu II fragment of C8-IFN-al.* The 
resulting hybrid plasmid was used to transform E. coli 
HB101 and the clones screened for I FN activity. One 
clone displaying a high activity was selected and desig- 
nated E. coli HB101 (C8-IFN-a2). 

DNA sequencing of hybrid plasmid C8-IFN-a2 
revealed that it like C.8- IFN-al had a coding sequence 
following the initiator triplet that determined the first 
seven amino acids of 0-galactosidase, a Pro residue 
generated by the fusion and amino acids 16 to 23 of the 
IFN-a2 signal sequence. Therefore, again a fused protein 
containing IFN-a2 is likely to be expressed. 

Minicells transformed with this plasmid gave 
100-200 million units per 1 of I FN or 20000 to 40000 
times higher yields of IFN-a2 than minicells transformed 
with unmodified 2-pBR322 (Pst )/Hif-II-206 . 

A comparison of the relative yields of C8-IFN-al 
(-50 x 10 6 units/liter) and C8-IFN-a2 (-100-200 million 
units per liter) is at first surprising because IFN-a2 
has been shown to be about 30 times more active than 
IFN-al on human cells (supra). However, quantitative 
analysis of the amount of the two proteins made by minicells 
transformed with the two C8 plasmids revealed that in 
C8-:IFN-al about 5 to 6 times more protein than in C8-IFN-a2 
was being made. Therefore, the yields measured on the 
basis of I FN activity had been skewed by the much greater 
amount of protein made in the case of C8- IFN-al. 

Referring now to Figure 26, another construction 
in an attempt to improve- the yield of IFN-a2 is described. 



* C8-IFN-al has three Pvu II restriction sites (Figure 28) 
The 2590 bp fragment is between P x and. P 3 . 



First, an expression plasmid containing the LAC Alu 
fragment was prepared by restricting the known lac promoter 
with Alu l and extending the fragment as shown ( for one 
termini) in Figure 27 with EcoRI linkers (prepared by 
colloborative research). The extended fragment was then 
inserted into pBR322 at the EcoR I site and a small 
EcoRI -EcoRI fragment deleted from the construction.. The 
resulting plasmid, designated 404. in Figure 26 was cleaved 
with Hind i 1 1 and PvuII for insertion of the IFN-or2 contain- 
ing fragment. That IFN-or2 fragment was prepared by 
partial Sau 3A restriction of Z-pBR322 (Pst)/HcIF-II-206 
("206" in Figure 26), extension of the Sau3A fragment 
with Hind i 1 1 linkers (Figure 27) and cleavage with Bsp l , , 
After insertion of this fragment into the Hind i I I- Pvu l I 
cleaved plasmid 404, the resulting plasmid was restricted 
with Hind i 1 1 and EcoRI, treated with Si nuclease to bring 
the LAC. promoter closer to the IFN-a2 gene and religated. 
This construction identified as plasmid LAC-AUG(cr2) has 
the IFN-ot2 DNA sequence under the control of the LAC 
promoter. Moreover, the IFN-a2 sequence immediately 
follows the initiating AUG codon of that promoter (see 
Figure 27). Therefore, at least a portion of the I FN 
produced by these plasmids will be mature I FN, e.g. , I FN 
without any amino acids from the signal sequence. In 
minicells the yield of IFN-ot2 obtained with plasmid 
LAC-AUG(a2) was 5-10 million units per liter. 

Another IFN-a2 construction based on similar 
linking principles has also been made. Here, the penicil- 
linase expression control sequence of pBR322 has been 
connected via an AUG initiator codon to the IFN-or2 gene 



from Hif-II-206.* This plasmid designated as p-lac-AUG(or2 ) 
when used to transform host cells affords the production 
of IFN-a2 without fusion to other protein sequences. In 
minicells, yields of 50-100. million units per liter have 
been observed. This plasmid is the most preferred plasmid. 
for use in accordance with this invention. It is also 
preferred for use with the other IFN-a genes disclosed 
herein. The preferred host in accordance with this inven- 
tion is E. coli DS410 (ara azide TonA lac y Tsx min a min 
b gal \ xyl step R ). This strain has been deposited together 
with an example of the preferred plasmid p-lac-AUG(a2.) as 
HcIF-K. 

Other constructions using various promoter 
sequences, ribosome binding sites, Shine-Dalgarno ' sequences 
and DNA sequences between the promoter and the AUG initiator 
codon and using various of the IFN-cr genes disclosed . 
herein can also be constructed using similar methods and 
principles. These constructions are of course envisioned 
by this invention and are within the scope thereof. 

HYBRID MOLECULES OF TFN-crl AND IFN-or2 

A number of hybrid molecules of IFN-orl and 
TFN-a2 have been constructed. Surprisingly these hybrid 
constructions have quantitatively different properties 
and activities as compared either of their parents — 
IFN-al or IFN-a2. 



* : This construction could most effectively be made by 
partial digestion of pBR322 with MboII, treating with SI, 
attaching Eco RI linkers as before and reinserting the 
fragment into the Eco RI site of pBR322 and deleting one 
Eco RI site. The resulting plasmid ("p-lac-AUG plasmid") 
could then be combined with an Si treated (mild) Hind i I I 
linker- Bsp I fragment of 206 described previously. After 
cleavage with Eco RI and treating with Si and phosphatase, 
the expression plasmid 0-l"ac-AUG(a2 ) is isolated. 
Constructions with other genes could be done in a similar 
fashion by merely utilizing the constructed 0-lac-AUG 
plasmid for insertion of other genes or constructions or 
more preferably by using the Sau 3A site in plasmid 
p-lac-AUG(a2 ) itself.. 



Referring now to Figure 28, a schematic outline 
of the construction of four of 'these hybrid molecules is 
displayed. For convenience these hybrid molecules are 
designated as plasmids I, II, III and IV. In these 
constructions digestions with restriction enzymes (obtained 
from Biolabs, with the exception of Bsp l , a gift of ^ 
Dr. Kiss) were carried out essentially as recommended by 
the suppliers. Partial DNA cleavages were carried out 
with decreased amounts of enzyme. After heat inactivation 
(65°C, 30 min) of the restriction enzyme, the samples 
were adjusted to 50 mM Tris-HCl (pH 8) and if required 
calf intestine alkaline phosphatase (Boehringen) (1 vol 
per \jg DNA) were added. After -30 min at 37°C, .the samples 
were extracted with phenol and ether. In most cases the 
DNA fragments were separated on low temperature gelling 
agarose (0.8%). For ligation the fragment containing gel 
pieces (about 20 pi each) were melted at 65°C, cooled to 
37 °C and 20 U per \il T4 DNA ligase was added. The mixture 
was kept at 15°C for 16 hrs and ligation occurred in the 
solidified gel (H. Lehrach, personal communication (1980)). 
One tenth vol of 100 mM Tris-HCl (pH 7.5), 100 mM CaCl 2 , 
100 mM MgCl 2 was added, the sample heated 5 min at 65°C 
and cooled to 37°C. The samples were. then added to 
Ca +2 - treated mini cells, incubated at 0°C for 20 min, 
heated 1 min at 42 °C and 10 min at 20 °C and 1 ml tryptone 
medium added. After incubation for 60 min at 37°C, the 
cultures were plated on to agar plates containing the 
appropriate antibiotics. All of the plasmids were charac- 
terized by nucleotide sequence analysis across the joint 
in the I FN sequence. 

Hybrid molecule I, an a-l(PyuII )a-2 hybrid, was 
constructed by partially cleaving C8-IFN-al ( supra ) (C8-al 
in Figure 28) with PyuII, dephosphorylated, cleaved with 
PstI and the Pstl-Pvuil (P 2 ) 1346 bp fragment isolated. 
This fragment was ligated to. a 2135 bp PstI ( a)-PyuI I (F 2 ) 
fragment prepared by totally cleaving C8-IFN-cr2 (supra) 
(C8-a2 in Figure 28) with PvuII and partially cleaving it 
with PstI. 
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Hybrid molecule II, an a-l(B2l.II )or-2 hybrid, 
was constructed by cleaving hybrid molecule I with Bql ll . 
After dephosphorylation, the large Bql ll fragment was 
isolated and ligated to the small Bgl H fragment of 
5 C8-IFN-a2. After cloning, the hybrid plasmid having the 
small Bql ll fragment in the correct orientation was 
identified by restriction analysis. 

Hybrid molecule III, an a-2(PvuII )a-l hybrid, 
was constructed by partially cleaving C8-IFN-crl with 
10 Pvu II , dephosphorylating, cleaving with Ava l and isolating 
the 1686 bp Pvul I (P 2 )-AvaI and 3233 bp PvuII (Pi ) -Aval 
fragments. These fragments were then ligated to the 
. 300 bp PvuII (Pi ) -Pvul I (P. 2 ). fragment of the HcIF-II-2.06 . , 
( supra ) (SN206 in Figure 28) and the plasmid containing 
15 the small Pvu II , fragment identified by assaying trans- 
formed E. coli strains for IFN-a activity. 

Hybrid molecule IV, an a-2 (B3III )a-l hybrid, 
was constructed by cleaving C8-IFN-ctl with Bql ll and Ava l 
and the 1776 bp fragment isolated. This fragment was 
20 then ligated to the 3543 bp Bgl ll- Ava l fragment of hybrid 
molecule III. 

The biological activities of the different 
interferon species relative to each other were also 
determined. Cultures of minicells (DS410) transformed 
25 with the various plasmids were grown and the bacteria 

harvested by centrifugation, washed with PBS, suspended 
in PBS (about 1/20 of the original vol), incubated for 
60 min at 0°C with 1 mg/ml lysozyme, 10 mM EDTA, freeze 
thawed four times, sheared by passing five times through 
30 ' a syringe and cleaved by centrifugation. 

The activities on human, mouse, guinea pig and 
bovine cells were as follows: 

Relative I FN Activity 
Protein Source Human (FS4) Bovine (BEK) Mouse (L) Guinea Pig 
C8-IFN-a2 1 1 0-01 0.03 

C8-IFN-al 0.03 1 0.3 0.03 

Hybrid I 0.001 1 0.003-0.008 0.003-0.01 

Hybrid II 0.001 1 0.001 0.01 



35 
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Hybrid III 1 • 1 1 0,3 

Hybrid IV 0.1 1 2 0.1-0.005 

Negative Control 

Surprisingly, all of the interferons have about the same 
5 activity on bovine cells, yet IFN-al and the two hybrid 

IFN's (I and II) which have the amino terminal moiety o.f 
IFN-al have about a 10- to 1000-fold lower activity on 
human cells than IFN-a2 and the two hybrid IFN's (III and 
IV) having the amino terminal moiety of IFN-a2. It is 

10 even more striking that the two hybrid IFN's (I and II) 

with the amino terminal part of IFN-al have more than a 
10-fold lower activity on human cells than IFN-al itself. 
Yet, one of the hybrids (III) with the amino terminal 
part of IFN-a2 has about the same, activity on human cells 

15 as IFN-a2. 

IDENTIFICATION OF 
CHROMOSOMAL GENES FOR IFN-a 

A collection of hybrid phage derived from 
fragments of fetal human chromosomal DNA which had been 

20 generated by partial cleavage with Haelll and Alul , and 

joined with EcoRI linkers to k Charon 4A arms has been 
prepared by R/ M. Lawn et al., Cell , 15, pp. 1157-74 
(1978). This gene bank was screened by an "in situ" 
procedure (W. D. Benton and R. W. Davis, Science, 196, 

25 pp. 180-82 (1977); T. Maniatis et al . , Cell , 15 , pp. 687- 

701 (1978)) using as a probe the 32 P-labelled IFN-alcDNA 
insert excised from pBR322 (Pst )/Hif-2h. Sixteen hybridi- 
zation-positive phage clones were isolated from 240,000 
plagues by repeated plaque purification (T. Maniatis 

30 et al., supra ). Ten of the hybrid phage DNA preparations 

were cleaved with Hind i II, Tad , Hhal , BamHI , EcoRI and 
Bglll , respectively, and the fragments separated by 
electrophoresis on an agarose gel, transferred to a 
Millipore membrane (E. M. Southern, J . Mol. Biol. , 98, 

35 PP- 503-17 (1975)) and hybridized with the 32 P-labelled 



Hif-2h cDNA insert. Figure 18 summarizes the results in. 
the form of partial restriction maps and various tables. 
As displayed there, for each hybrid phage DNA preparation 
at least a few characteristic restriction sites were 
established and the region(s) hybridizing to the IFN-orl 
gene probe delineated (black arrow).'* 

Referring now to Figures 18 and 24, it can.be 
seen that clones chr-3 and chr-26 may represent DNA 
segments which overlap over much of their length because 
they have several EcoRI and Hind i I I fragments in common. 
In addition, the hybridizing portion of chr-1 and one. of 
the hybridizing portions of chr-10 may be the same because 
the Hindi I I -Hindi I I and Eco RI - EcoR I fragments, which 
.hybridize with the Hif-2h probe have the same length 
(3.2kb and 0.95kb, respectively). It also appears that 
the "right-hand" hybridizing portion of chr-16 (labelled 
"r" in Figure 18) may be identical with the hybridizing 
portion of chr-3 5, although orientated in the opposite 
direction, inasmuch as each of the two clones yields a 
1.4kb Bgl ll- Bgl ll fragment and a 2kb EcoRI -EcoRI fragment 
which hybridize with the Hif-2h cDNA probe. Therefore, 
most likely the chr-16 and chr-35 inserts overlap. 

Accordingly, it appears that the 13 hybridizing 
portions of the 11 hybrid chromosomal DNAs fall into -not 
less than 10 distinct classes — chr-1, chr-3, chr-12, 
chr-13, chr-16 (left hand, labelled "1" in Figure 18) 
chr-26, chr-3 0, chr-35, chr-19 and chr-27. 

Referring now to Figure 24, the overlapping of 
chr-1, chr-3, chr-10 'and chr-26 and that of chr-16 and 
chr-35 are displayed. 

The above data suggests that the genome of an 
individual human contains not less than 10 different DNA 
sequences that cross-hybridize to Hif-2h. This conclusion 
is reinforced by the fact that the proportion of fragment 



* The shaded area on chr-16 in Figures 18 and 24 
represents a sequence which hybridizes weakly to.Hif-2h 
cDNA but did not display R-looping. 



Hif-2h-related sequences detected in the clone bank is 
about 1 in 16,000. Assuming a value of 3 x 10 9 bp for 
the haploid human genome, the expected value for a single 
gene copy with an average DNA fragment size of 16kb (the 
average value of the clones examined) is about 1:190,000. 
Therefore, the frequency of Hif-2h related fragments is 
12 times higher than expected for a single gene. 

In comparison to these data, when Lawn et al 
( supra ) screened 300,000 plaques from the. same gene bank 
with a p-globin cDNA probe, only 2 positive clones were 
identified — the expected value being 1.6:300,000. 
Therefore, there may be 10-15 distinct chromosomal DNA 
segments in the human genome that cross-hybridize to the 
Hif-2h fragment or IFN-orlcDNA. 

FURTHER CHARACTER I Z AT I ON OF Hif-chr35 

As an illustration only, the hybridizing sequence 
of chr-35 ("Hif-chr 35") was further characterized. It 
is to be understood that the hybridizing portions of the 
previously described chromosomal hybrid phages could 
similarly be characterized and handled without departing 
from the scope of this invention. 

The hybridizing portion of chr-35 ( "Hif-chr35" ) 
(and the right-hand segment of Hif-chrl6 to which it is 
most likely identical, supra ) is the only hybridizing 
chromosomal DNA portion with a Bgl ll site. Since IFN-crl 
and IFN-a2 cDNA's have 1 and 2 Bgl ll sites, respectively, 
within their coding sequences, it seems likely that 
Hif-chr35 is a counterpart to one of the two previously 
cloned interferon genes. Hif-chr35's strong hybridization 
to the 3 f terminal Hif-2h cDNA fragment (containing only 
the 3 1 nbh-coding region) compared to the weaker hybridi- 
zation of the other of the chromosomal DNAs to this probe 
supports a likely correspondence of Hif-chr35 and Hif-2h 
(F.Kafatos et al . , Ffoc. Natl. Acad. Sci USA , 74, 
pp. 5618-22 (1977) ) . 



To analyze the Hif-chr35 fragment further, a 
Hind lll- BamH I fragment was excised from chr-35. This 
fragment (3.4kb) contains the hybridizing portion 
( "Hif-chr35" ) of chr-35. This fragment was subcloned 
into the PstI site of pBR322 using the well-known dC-dG 
tailing procedures (L. Villa-Komarof f et al., supra ) and 
E. coli HB101 transformed with the resultant recombinant 
DNA molecule using well-known procedures (e.g., S. Nagata 
et al., Nature , 284, pp. 316-20 (1980)). 

Clones of these trans formants were screened by 
in-situ colony hybridization (D. Hanahan and M. Meselson, 
Gene , 10, pp. 63-67 (1980)) with the 32 P-labelled Hif-2h 
fragment ( supra ) and. plasmid DNA .— Z-pBR322 (Pst )./HchrIF- 
35HB "HchrIF-35HB" — was the separated from the positive 
clones (N. M. Wilkie et al., Nucleic Acids Res. , 7, 
pp. 859-77 (1979); Method B). The orientation of the 
hybrid insert "HchrIF-35HB fragment" in the plasmid with 
respect to the p-lactamase gene of pBR322 was determined 
by Eco RI cleavage and sizing of the resulting fragments. 
The insert orientated to coincide with that of ^-lactamase 
was designated a and the opposite orientation p. 

Cultures of these positive clones were grown to 
an apparent OD 6 so = °- 8 and bacteria harvested and 
lysed by the lysozyme-freeze-thaw method described in 
S. Nagata et al., supra . Seven of the 10 clones examined 
showed IFN-ff activities of 75 to 500 units/g of cells 
- (cytopathic effect reduction assay). 

The DNA insert of one of these 7 IFN-producing 
clones — E. coli HB iOl ( Z-pBR322 (Pst )/HchrIF-35HBa ) — 
was further characterized by restriction analysis and 
nucleotide seguence determinaton. Plasmid DNA (HchrlF- 
35HBa ) was prepared from the clone as described previously 
and the restriction sites determined by Smith-Birnstiel 
mapping (H. 0. Smith and M. L. Birnstiel, Nucl . Acids Res . , 
3, pp. 2387-98 (1976)): HchrIF-35HBcr was digested with 
EcoRI, labelled at the 5 1 termini and digested with Bgl ll 
(and Pst I to cleave an undesired fragment of about Ikb). 
The 1.04kb EcoRI -Ball I (3 1 proximal) and the 0.96kb 



Eco RI - Bgl ll (5 1 proximal) fragments were . isolated by 
agarose gel electrophoresis as described by A. C. Peacock 
and C. W. Dingman, Biochemistry , 6, pp. 1818-27 (1967)). 
Both fragments were partially cleaved with Hinf I , Bsp l 
and Mbo II respectively and the products separated on a 
1% agarose gel in Tris-acetate buffer (ph 7.8) containing 
1 pg/ml ethidium bromide. After staining, the radioactive 
bands were visualized by autoradiography. The BstNI and 
HqiA I sites were similarly determined on the 1.04kb 
(3 ! proximal) fragment. The results of the analysis is 
shown in Figure 19. . 

For nucleotide sequence determination, HchrlF- 
35HBa was cleaved with various restriction enzymes, the 
products separated by electrophoresis through a 5% poly- 
acrylamide gel in Tris-borate-EDTA buffer (A. C. Peacock 
and C. W. Dingman, supra ) and extracted from the gel and 
purified as described by W. Miller et al., J. Mol. Biol. , 
124/ pp. 343-58 (1978). 

The sequencing strategy employed is depicted in 
Figure 19 and described as follows: 

1 and 2 — cleavage of HchrIF-35HBcr with Bgl ll, 
labelling, cleavage with Eco RI and Pst I and isolation of 
a BglII*-EcoRI fragment (940bp) ("1".) and a B2lII*-EcoRI 
(360bp) ("2"); 

3 and 4 — cleavage of HchrIF-35HBa with Eco RI , 
labelling, cleavage with Bsp l and isolation of an EcoR I *- 
Bsp l fragment (680bp) ("3 lf ) and an EcoRI *-BspI , fragment 
(880 bp) ("4"); 

5, 6., 7 and 8 — cleavage of HchrIF-35HBor with 
Pvu II , labelling, cleavage with Bgl ll and Eco RI and 
isolation of Pvu II*-EcoRI fragment (780bp) ("5"), a 
PvuII*-BqlII (215bp). ("6" ), a PvuII*-BglII fragment 
(90bp) ( ,, 7 ,, ) / and a PvuII*-EcoRI fragment (290bp) ("8"); 

9 and 10 — cleavage of HchrIF-35HBa with 
Eco RI, isolation by 1% agarose gel electrophoresis in 
Tris-borate EDTA buffer of a 1300 bp EcoRI -EcoRI fragment, 
further cleavage with Hinf I and isolation of a Hinf I- Hinf l 
fragment (450bp), and a Hinf I- Hinf I fragment (180bp). 
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Labelling, of the larger Hinf I- Hinf I fragment and cleavage 
with Mho I I permitted isolation of a Hinf I*- Mbo II (190bp) 
("9"). Labelling of the shorter Hinf I- Hinf I fragment and 
cleavage with Ava il permited isolation of a Hinf I*- Ava II 
5 fragment (150bp) ("10" ); 

11 — cleavage of Hchr IF-35HBcr with MboII, 
labelling, cleavage with Bgl ll , and isolation of a Mbo. II*- 
Ball I fragment (465bp) ("11") 

12, 13 and 14 — cleavage of Hchr I F-3 5HBa with 

10 ' Bsp l and Bgl ll , isolation by agarose electrophoresis as 

above of a 1200bp Bsp l- Bgl ll fragment and (a) cleavage 
with HqiA I, labelling, cleavage with Mbo II and isolation 
of an - HqiA I*- Mbo II fragment (300bp) ("12" ) and an Hqi AI*- 
MboII fragment (360bp) ("13"), or (b) cleavage with 

15 BstNI , labelling, cleavage with Eco RI and isolation pf a 

BstNI *-EcoRI fragment (380bp) ("14"). " 

The various fragments were sequenced by the 
Maxam-Gilbert procedure ( supra ). All fragments were 
sequenced on both strands and across the restriction 

2Q sites that served as origins for sequencing. 

A comparison of the nucleotide sequence of the 
coding region of HchrIF-35HBa and that of Hif-2h (coding 
region) (Figures 8-10 compared to Figures 20-23) reveals 
that they are identical. In particular, it is surprising 

25 that there is no indication of the presence of introns 

within the coding sequence of the Hchr IF-3 5HBa fragment, 
- i.e., between the Hinf I site in the 5 f non-coding region 
and the Eco RI site in the 3' non-coding region. 
Thus, no intron could be detected in the chromosomal 

30 sequence corresponding to mature IFN-ormRNA. 

FURTHER CHARACTER I Z AT I ON OF 
Hif-chr 26 AND Hif-chr 3 



The gene-containing segments of chr-3 and 
chr-26, which appear identical by heteroduplex analysis 
but differ in at least one Bgl ll restriction site were 
examined by nucleotide sequencing. Five nucleotide 



differences in 725 base pairs were founds. Only two of 
these appear in the coding sequences. Since not only the 
genes, but at least 3.5 Kbp preceding and 6.0 Kbp follow- 
ing them formed a perfect heteroduplex and because of the 
relatively low sequence divergence which entails only 2 
amino acid changes, it appears that Hif-chr3 and Hif-chr26 - 
are allelic forms of the same gene. These are designated 
IFN-a4a (Hif-chr3) and IFN-a4b (Hif-chr26). The nucleotide 
sequence and corresponding amino acid sequence of IFN-or4b 
determined by conventional sequencing techniques described 
previously is displayed in Figures 29-32. 

A comparison of Figures 29-32 with Figures 8-10, 
12-16 and 20-23 reveals that the proteins, encoded by each, 
of the sequences differ from each other in about 15% of 
their residues. This divergence is typical for products 
of non-allelic genes which have diverged 20-90 million 
years ago. 

Expression Of Hif-chr35 In Mouse Cells 

Plasmid 2-pBR322 (Pst )/HchrIF-35HBa ( supra ) was 
used as a source of a Hif-chr35 fragment for expression 
in mouse cells. The plasmid was restricted with PstI and 
treated with 5 1 exonuclease' to remove the 5 1 dG tails. 
This fragment was then inserted into a 5' dG- tailed Kpn l 
fragment of a plasmid prepared by joining the BamH I- BamH I 
fragments of pBR322 and polynoma DNA. The resulting 
vector was used to transform mouse 3T3 cells using the 
calcium phosphate technique (N. Mantei et aj. , Nature , 
281 pp. 40-46 (1979) ) . These transformed cells are 
designated for convenience Mouse 3T3 (polynoma-Hif-chr35 ) . 
After 20-40 hours, assays revealed an IFN-cr activity of 
300 units/ml of IFN-a on human cells and about 3000 
units/ml of IFN-a on bovine cells. 

It should of course be understood that the 
nucleotide sequences depicted in Figures 8-10, 12-16, 
20-23 and 29-32 do not take into account any modifications 
to the nucleotide sequences such as mutation, including 



single or multiple, base " substitutions , insertions, 
inversions or deletions which may have already taken 
place or which may subsequently be employed. Moreover, 
the sequence also does not take into account the possible 
substitution of other codons coding for the same amino 
acid as a codon depicted in these figures. Therefore,^ it 
should be understood that such modified sequences as code 
for polypeptides displaying an immunological or biological 
activity of IFN-a are also within this invention. 

In addition, it is to be understood that the 
amino acid sequences depicted in Figures 8-10, 12-16, 
20-23 and 29-32 do not take into account any modifications 
to the polypeptides caused by their interaction with 
in vivo or in vitro agents e.g. in vivo glycosolat'ion 
enzymes. Therefore, it must be understood that fragments 
and derivatives of these polypeptides that display an. 
immunological or biological activity of IFN-a are also 
part of this invention. 

PRODUCTION OF POLYPEPTIDES DISPLAYING AN 
IMMUNOLOGICAL OR BIOLOGICAL ACTIVITY OF 
INTERFERON IN BACTERIAL HOSTS 

Since the cytopathic effort reduction assay 
(W. E. Stewart II and.S. E. Sulkin, S. E. Proc. Soc. Exp. 
Biol. Med. , 123, pp. 650-53 (1966)) can detect minute 
amounts of I FN — less than one active molecule per 
bacterial cell — lysates of E . coli HB101 infected with 
the ten hybrid k phages, described previously, were 
assayed for the presence of I FN. Seven of the eleven 
phages (all except chr-10, chr-12, chr-19 and chr-27) 
gave lysates containing I FN activity ranging from 3 to 
50 units/ml. In the case of chr-10 and chr-12, the 
hybridizing (to Hif-2h) Hind lll- Hind lll or EcoRI-EcoRI 
fragments, subcloned into the PstI site of pBR322, as 
described previously, expressed IFN-a activity in E. coli . 
Since E. coli is believed to be incapable of splicing 
mRNA (O. Mercereau-Pui jalon and P. Kourilsky, Nature , 



279, pp. 647-49 (1979)), these IFN-cr chromosomal genes 
most likely do not contain introns in their coding region. 



FINAL CONCLUSIONS 

We have isolated a set of recombinant DNA 
5 molecules containing cDNA . prepared from poly (A) RNA from 
Sendai virus-treated (induced) human leukocytes, represen- 
tatives of which have the following properties: 

(1) They hybridize to poly (A) RNA from induced 
but not from non- induced human leukocytes. 
10 (2) They hybridize to IFN-amRNA as shown by 

their capacity to select this RNA from a mixture of RNAs, 
and by their capacity to inhibit, (reversibly) translation 
of interferon mRNA in the hybrid arrested translation 
assay. 

15 (3) E. coli containing certain members of the 

set produce a compound with the following properties: 

(a) It is sensitive to trypsin 

(b) It exhibits IFN-a activity in a human cell 
system and only slight activity in a mouse cell swb 



20 3Dd&b Xt a molecular Y eight between ^pH 

and ao'OQO (19,388 based on the nucleotide sequencing of 
Figs. 8-10) 

(d) The IFN-a activity is specifically inhibited 
by antibody to human leukocyte interferon. 

25 (4) The DNA inserts of the hybrid plasmids of 

this invention are able, in addition to their ability to 
select IFN-amRNA from a mixture of RNAs, to select IFN-crDNA 
from mixtures of various sources including cDNAs and from 
hybrid phage gene banks of chromosomal DNA. 

30 (5) A number of different chromosomal genes 

for IFN-a exist- It is unexpected that these genes lack 
introns and permit direct expression of interferon and 
interferon- like polypeptides in appropriate hosts. 

(6) At least three of. the nucleotide sequences 

35 of the DNA inserts of these recombinant DNA molecules are 
different and suggest the existence of at least three 
non-allelic genes for IFN-a. 



(7) The proteins coded for by these three 
nucleotide sequences are different from the 35 amino 
acids determined from authentic lymphoblastoid interferon. 

(8) Hybrid proteins prepared for various 

5 combinations of IFN-a gene segments display quantitatively 
different properties than each other or their parents and 
proteins having additional amino acids fused to IFN-a or 
proteins comprising IFN-a without a portion of its amino 
terminal sequence display I FN activity. 
10 These properties demonstrate that the recombi- 

nant DNA molecules described by this invention contain at 
least a part of the coding sequence for human leukocyte 
interferon and that some of these plasmids lead to expres- 
sion in E. coli of a polypeptide with an immunological or 
15 biological activity of human leukocyte interferon. It 
should also be evident that the polypeptides disclosed 
herein may be fragmented, modified or derivatized, as is 
well known in the protein art, without departing from the 
scope or disclosure of this invention. 
20 Micro-organisms and recombinant DNA molecules 

prepared by the processes described herein are exemplified 
by cultures deposited in the culture collection Deutsche 
Sammlung von Mikroorganismen, in Go ttingen , West Germany 
on January 7, 1980, and identified as HcIF-A to E: 
25 A: E. coli HB101 (Z-pBR322 (Pst )/HcIF-4c ) 

B: E. coli HB101 (Z-pBR322 (Pst)/HcIF-2h) 
C: E. coli HB101 (Z-pBR322 (Pst)/HcIF-SN35 ) 
D: E. coli HB101 (Z-pBR322 (Pst )/HcIF-SN42 ) 
E: E. coli HB101 (Z-pKT287(Pst)/HcIF-2h-AH6 ) 
30 These cultures were ass.igned accession numbers DSM 1699- 
• 1703 , respectively . 

In addition, micro-organisms and recombinant 
DNA molecules prepared by the processes described herein 
are exemplified by cultures deposited in the culture 
35 collection of the American Type Culture Collection, 
Rockville, Maryland on March 27, 1980 and identify 
$ < ffi?£\l 1 fo B ' assig 11 ^ A 100 accession numbers 

miu 3/634! respectively: 




G: E. coli HB101 (Z-pBR322 (Pst )/HcIF-II-206 ) 
H: ' E. coli HB101 ( Z-pBR322 (Pst )/HcIF-SN35-AHL6 ) 
Other micro-organisms prepared by the processes described 
herein are exemplified by cultures deposited in the 
culture collection Deutsche Sammlung von Mikroorganismem, 
in Gottingen, West Germany on October 1, 1980 and identi- 
fied as HchrlF-A through J, and assigned accession numbers 
DSM 1914-1923: 

A. subcloned Hind i I I fragment of chr 3 in 

E. coli HB101; 

B. subcloned EcoRI fragment of chr 12 in 

E. coli HB101; 

C. subcloned Hind i 1 1 fragment of chr 12 in 

E. coli HB101; 

D. subcloned Eco RI fragment of ..chr 13 in 

E. coli HB101; 

E. subcloned EcoR I fragment of chr 23 in 

E. coli HB101; 

F. subcloned Hind i I I fragment of chr 23 in 

E. coli HB101; 

G. subcloned EcoR I fragment of chr 26 in 

E. coli HB101; 

H. subcloned Hind i I I fragment of chr 26 in 

E. coli HB101; 

I. subcloned Hind lll/ BamH I fragment of chr 35 

in E. coli HB101; 

J. subcloned BamH I fragment of chr 35 in 

E. coli HB101, 

Finally, micro-organisms prepared by the 
processes described herein are exemplified by cultures 
deposited in the American Type Culture Collection, 
Rockville, Maryland, on December 15, 1980 and identified 
as HchrlF-K through HchrlF-Q and HcIF-I through HcIF-K, 
and assigned ATTC accession numbers 31760-31769, 
respectively: 

K. subcloned Tac-Tac fragment of chr 23 in 

E. coli HB101. 



in i^*™ 1 ^^ Hinant-Bindl" fra^ent of 
chr 10 . ^^^-^xx-Bain frao»e„t of chr 2. 
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chr 30 in £ragBe »t of chr 13 i» 

T' «*el— BslU-Tac fragment of chr 1. 1 i» 
TlF-I" E. coli DS410 (C8-IFN-02) 

o£ embodiments of this rnventro- it P^ ^ 
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this invention. Therefor . fcy ^ 

the scope , of this invention is « 

example- 



